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Preface 


In  this  book,  you’ll  find  an  introduction  into  two  key  parts  of  physics:  mechanics  and  special  relativity.  The 
material  in  the  book  has  evolved  from  lecture  notes  on  courses  in  introductory  physics  and  relativity  I  have 
taught  at  TU  Delft  since  2012.  In  most  cases,  not  all  of  the  material  covered  in  the  book  were  discussed  in 
the  lectures.  In  particular  sections  indicated  with  a  star  are  extra  material,  for  those  interested  in  learning 
more.  In  the  first  part  on  classical  mechanics,  the  chapters  do  not  necessarily  be  taught  or  read  in  the  order 
I  have  presented  them.  Many  of  the  concepts  of  chapter  8  can  be  understood  based  on  the  material  covered 
in  chapters  -3.  In  the  second  part  we  discuss  the  special  theory  of  relativity,  for  which  especially  chapters  3 
on  energy  and  on  momentum  from  the  first  part  are  important.  There  are  thus  multiple  paths  you  can  take, 
and  I  encourage  you  to  look  ahead  sometimes  to  see  how  what  is  yet  to  come  ties  in  with  what  is  discussed  at 
a  given  point  in  the  book.  If  you  need  refresher  on  some  of  the  mathematical  techniques,  appendix  contains 
some  useful  background  maths.  Throughout,  I’ve  tried  to  alternate  theory  with  worked  examples,  to  give  you 
an  idea  about  what  you  can  actually  do  with  the  theory  just  developed. 

Students  often  ask  me  how  to  best  study  for  an  exam.  Here  are  three  key  steps  to  successfully  completing 
any  course  in  physics  (or  probably  any  field  of  study),  which  for  maximum  effect,  are  best  taken  throughout 
the  course: 

Prepare:  Read  the  assigned  sections  before  the  class,  look  at  the  problems  before  the  tutorial. 

Participate:  Attend  class,  join  in  any  quizzes  /  problem  sessions  offered,  ask  questions  whenever  you 
don’t  understand. 

Practice:  Make  all  problems,  both  in  practice  sessions  and  in  homework.  Try  yourself  before  an  exam 
by  doing  last  year’s  exam.  Resist  the  urge  to  look  up  the  answers. 

Like  learning  maths,  or  playing  the  piano,  learning  to  do  physics  takes  practice.  There  are  no  quick  fixes. 
This  especially  goes  for  doing  problems:  looking  up  the  solution  is  not  equivalent  to  finding  the  solution 
yourself!  Of  course  I  understand  you  want  to  check  your  answer,  which  is  fine  -  but  do  not  look  for  the  solution 
method,  as  you’ll  only  fool  yourself  into  thinking  you  understand.  You  only  really  understand  if  you’ve  found 
the  solution  yourself,  and  can  explain  to  someone  else  why  it  gives  the  right  answer.  The  numerical  value 
of  that  answer  is  of  little  interest  -  it’s  the  method  that  counts!  In  fact,  you  should  avoid  putting  in  numbers 
altogether  (only  do  so  in  the  final  step  if  a  number  is  asked  for),  as  a  symbolic  answer  will  tell  you  much 
more:  whether  the  dimensions  match,  whether  limit  cases  make  sense,  and  whether  the  answer  itself  can  be 
understood  from  scaling. 

Now  if  you  find  you  do  not  understand  something  -  no  worries,  that’s  what  teachers  are  for!  By  all  means, 
ask.  Of  course,  you  can  also  ask  another  student  -  you’d  actually  be  doing  them  a  favor,  as  by  explaining  it 
to  you,  they  get  better  understanding  themselves.  Moreover,  whenever  you  find  in  class  or  during  a  problem 
session  you  do  not  understand,  there  are  likely  others  who  don’t  understand  either  -  and  you’d  be  helping  the 
teachers  by  asking  a  question. 

My  goal,  as  a  teacher,  is  to  help  you  get  a  feeling  for  the  basic  principles  that  govern  the  physical  world 
both  at  the  everyday  level  and  in  the  exotic  world  of  relativity.  However,  I  cannot  do  this  without  your  active 
participation,  nor  without  your  feedback  in  the  form  of  questions.  I  expect  a  lot  from  you,  and  you  can  expect 
a  lot  from  me  too.  In  particular,  I  will  keep  working  on  improving  this  text,  and  so  any  questions  or  comments 
you  may  have  are  very  welcome  - 1  love  interaction!  So  please  don’t  be  shy,  but  by  all  means  let  me  know  what 
you  think  of  the  book,  and  whether  it  helped  you  understand  physics  better. 

HOW  TO  TACKLE  A  PROBLEM:  THE  IDEA  METHOD 

Americans  love  acronyms  (apparently  these  are  a  good  way  of  getting  things  stuck  in  people’s  head),  so  they 
call  the  three  key  steps  in  the  previous  section  the  ‘PPP’  method,  and  have  likewise  come  up  with  a  nice 
one  for  a  very  useful  way  to  tackle  a  physics  problem:  Identify,  Develop,  Evaluate,  and  Assess,  or  IDEA.  This 
method  is  especially  helpful  for  the  often  feared  first  step:  how  do  you  translate  a  problem  into  a  set  of  equa¬ 
tions  that  you  can  solve?  The  first  thing  to  do  is  usually  quite  simple:  identify  which  kind  of  problem  you’re 
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dealing  with.  You  may  be  asked  about  a  minimal  force  to  keep  things  stable,  the  amount  of  work  necessary 
to  perform  a  certain  task,  or  the  velocities  of  two  billiard  balls  after  a  collision.  We’ll  cover  such  cases  in  the 
worked  examples  and  problems,  where  the  identification  step  has  already  been  taken  (it’s  the  topic  under 
discussion)  -  but  in  the  end  you  should  be  able  to  classify  the  problem  yourself.  There  is  a  huge  bonus  to  that 
classification,  as  it  immediately  tells  you  which  laws  apply  -  see  the  mind-map  in  figure  .  I  encourage  you 
to  make  a  similar  scheme  for  both  the  mechanics  and  the  relativity  part  of  the  course  for  yourself  -  it’s  a  great 
way  of  summarizing  the  material. 
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Figure  1:  (Simplified)  schematic  overview  of  a  classification  of  mechanics  problems,  with  the  relevant  laws  indicated. 

Once  you  have  identified  the  type  of  problem,  you  can  go  ahead  and  develop  it.  First,  make  a  sketch  of 
the  situation,  and  collect  all  necessary  information,  which  you  can  put  in  the  sketch  (for  example  drawing  all 
relevant  forces  in  a  free  body  diagram,  see  section  2.  ).  Second,  write  down  the  relevant  laws  in  the  form  that 
applies  to  the  case  at  hand.  In  some  cases,  this  is  simply  one  equation,  while  in  others,  you  may  end  up  with  a 
system  of  equations  -  so  the  evaluation  step  can  be  quite  complicated.  In  either  case,  you  will  find  your  sketch 
helpful  in  identifying  all  relevant  laws,  and  thus  equations.  Now  that  you  have  developed  the  problem,  the 
next  step  is  to  evaluate  it,  i.e.,  solve  the  equations  for  the  quantity  asked  for  in  the  problem.  To  simplify  the 
evaluation,  it  is  often  easier  not  to  plug  in  any  numbers  -  except  of  course  when  the  numbers  are  very  simple 
or  hugely  simplify  your  system  (such  as  crossing  out  terms  that  are  zero).  Exactly  because  the  evaluation  step 
can  be  tricky,  it  is  always  a  good  idea  to  assess  your  answer.  Does  the  number  you  get  in  the  end  make  sense? 
Does  your  answer  have  the  right  dimensions  (see  section  1.2)?  Does  it  behave  in  the  way  you’d  expect  if  you 
take  the  value  of  one  of  the  parameters  to  zero,  or  infinity?  We’ll  practice  with  this  explicitly  in  the  problems, 
and  I  encourage  you  to  keep  on  doing  it  -  it  will  save  you  from  some  embarrassing  mishaps  in  the  end. 
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Heros  of  physics 

All  of  the  material  discussed  in  this  book  is  classical  physics  -  not  just  the  mechanics  as  developed  in  the 
17th  century,  but  also  the  special  relativity  of  the  early  20th  century  (now  well  over  100  years  ago) .  Of  course, 
physics,  or  rather  the  physicists,  have  not  been  idle  since,  and  many  new  physical  principles  have  been  dis¬ 
covered  since  Einstein,  but  the  ones  discussed  here  remain  valid.  In  fact,  they  remain  extremely  useful,  not 
only  in  many  applications  (including  engineering,  architecture,  and  spaceflight),  but  also  in  ongoing  physics 
research.  For  example,  in  my  own  field  of  biophysics,  a  key  process  is  cell  division,  in  which  the  pulling  apart 
of  the  two  copies  of  the  DNA  is  a  crucial  step.  Forces  generated  by  small  molecular  motors  are  central  players 
in  that  process,  and  they  obey  Newton’s  laws  just  like  the  (probably  apocryphal)  apple  falling  on  Newton’s 
head  did. 

In  doing  research  or  applications  today,  we  are,  in  Newton’s  words,  standing  on  the  shoulders  of  the  gi¬ 
ants  who  came  before  us.  I’ve  added  the  stories  of  some  of  these  giants  throughout  the  book,  though  with 
a  somewhat  double  feeling.  On  the  one  hand,  these  giants  definitely  deserve  the  credit  for  the  work  they’ve 
done,  and  some  of  them  were  examples  of  dedication  and  diligence.  On  the  other  hand,  the  list  of  people  can 
never  be  inclusive  (there  are  many  others  who  contributed  as  well),  and  the  list  has  the  distinct  disadvantage 
of  being  very  un- diverse:  these  people  are  all  white,  almost  all  men,  and  almost  all  from  Europe.  The  reason 
for  this  limited  variety  of  course  is  that  up  until  the  early  20th  century,  only  white  European  men  had  any 
chance  of  being  in  a  position  privileged  enough  to  be  able  to  dedicate  a  significant  amount  of  their  time  to 
research.  That  is  not  to  say  that  they  never  met  trouble  (both  Einstein  and  Noether  had  to  flee  Germany  when 
the  Nazi’s  seized  power  in  1933),  but  that  others  simply  never  got  the  chance.  Fortunately,  although  we’re  still 
far  from  a  perfect  world,  things  on  that  front  have  improved  tremendously,  and  so  please  take  these  people 
for  what  they  were:  dedicated,  curious  people  who  were  interested  in  finding  out  how  the  world  works.  I  hope 
the  same  goes  for  you,  and  that  with  this  book,  I  can  help  you  a  bit  on  the  path  towards  becoming  a  physicist 
yourself. 


Timon  Idema 
Delft,  September  2018 
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1 

Introduction  to  classical  mechanics 


Classical  mechanics  is  the  study  of  the  motion  of  bodies  under  the  action  of  physical  forces.  A  force  is  any 
influence  that  can  cause  an  object  to  change  its  velocity.  The  object  can  be  anything  from  an  elementary 
particle  to  a  galaxy.  Of  course  anything  larger  than  an  elementary  particle  is  ultimately  a  composite  of  el¬ 
ementary  particles,  but  fortunately  we  usually  don’t  have  to  consider  all  those,  and  can  coarse-grain  to  the 
scale  of  the  objects  at  hand.  As  is  true  for  any  physical  model,  classical  mechanics  is  an  approximation  and 
has  its  limits  -  it  breaks  down  at  very  small  scales,  high  speeds  and  large  gravitational  fields  -  but  within  its 
range  of  applicability  (which  includes  pretty  much  every  single  phenomenon  in  everyday  life)  it  is  extremely 
useful. 

Classical  mechanics  is  based  on  a  small  number  of  physical  laws ,  which  are  mathematical  formulations 
of  a  physical  observation.  Some  laws  can  be  derived  from  others,  but  you  cannot  derive  all  of  them  from 
scratch.  Some  laws  are  axioms,  and  we’ll  assume  they  are  valid.  The  laws  we’ll  encounter  can  be  divided  up  in 
three  classes:  Newton’s  laws  of  motion,  conservation  laws  and  force  laws.  As  we’ll  see,  the  three  conservation 
laws  of  classical  mechanics  (of  energy,  momentum  and  angular  momentum)  can  be  derived  from  Newton’s 
second  and  third  laws  of  motion,  as  can  Newton’s  first  law.  The  force  laws  give  us  the  force  exerted  by  a 
certain  physical  system  -  a  compressed  spring  (Hooke’s  law)  or  two  charged  particles  (Coulomb’s  law)  for 
example.  These  also  feed  back  into  Newton’s  laws  of  motion,  although  they  cannot  be  derived  from  these  and 
are  axioms  by  themselves. 

In  addition  to  the  physical  laws,  there  is  a  large  number  of  definitions  -  which  should  not  be  confused  with 
the  laws.  Definitions  are  merely  convenient  choices.  A  good  example  is  the  definition  of  the  number  n:  half 
the  ratio  of  the  circumference  to  the  radius  of  a  circle.  As  you  have  no  doubt  noticed,  it  is  very  convenient  that 
this  number  has  gotten  its  own  symbol  that  is  universally  recognized,  as  it  pops  up  pretty  much  everywhere. 
However,  there  is  no  axiom  here,  as  we  are  simply  taking  a  ratio  and  giving  it  a  name. 

1.1.  Dimensions  and  units 

In  physics  in  general,  we  are  interested  in  relating  different  physical  quantities  to  one  another  -  we  want  to 
answer  questions  like  ‘how  much  work  do  I  need  to  do  to  get  this  box  up  to  the  third  floor’?  In  order  to  be 
able  to  give  an  answer,  we  need  certain  measurable  quantities  as  input  -  in  the  present  case,  the  mass  of  the 
box  and  the  height  of  a  floor.  Then,  using  our  laws  of  physics,  we  will  be  able  to  produce  another  measurable 
quantity  as  our  answer  -  here  the  amount  of  work  needed.  Of  course,  you  could  check  this  answer,  and  thus 
validate  our  physical  model  of  reality,  by  measuring  the  quantity  in  question. 

Measurable  (or  ‘physical’,  or  ‘observational’)  quantities  aren’t  just  numbers  -  the  fact  that  they  correspond 
to  something  physical  matters,  and  10  seconds  is  something  very  different  from  10  meters,  or  10  kilograms. 
The  term  we  use  to  express  this  is,  rather  unfortunately,  to  say  that  physical  quantities  have  a  dimension  -  not 
to  be  confused  with  length,  height  and  width.  Anything  that  has  a  dimension  can  be  measured,  and  to  do 
so  we  use  units  -  though  there  may  be  different  units  in  which  we  measure  the  same  quantity,  such  as  cen¬ 
timeters  and  inches  for  length.  When  measuring  the  same  quantity  in  different  units,  you  can  always  convert 

1 A I  the  time  of  writing,  the  unit  of  mass  is  still  determined  using  a  prototype  in  Paris,  however,  a  redefined  unit  based  on  the  value  of 
Planck’s  constant  is  expected  to  be  adopted  on  May  20,  2019. 
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quantity 

symbol 

unit 

symbol 

based  on 

length 

L 

meter 

m 

speed  of  light 

time 

T 

second 

s 

caesium  atom  oscillation 

mass 

M 

kilogram 

kg 

Planck’s  constant 

current 

I 

Ampere 

A 

electron  charge 

temperature 

T 

Kelvin 

K 

Boltzmann’s  constant 

luminosity 

J 

candela 

cd 

monochromatic  radiation 

particle  count 

N 

mole 

mol 

Avogadro’s  constant 

Table  1.1:  Overview  of  the  SI  quantities  and  units,  and  the  physical  constants  they  are  (or  are  proposed  to  be)  based  on. 


Figure  1.1:  A  harmonic  oscillator:  a  mass  m  suspended  on  a  spring  with  spring  constant  k,  oscillating  with  a  frequency  id. 


between  them  -  there  are  2.54  centimeters  in  an  inch  -  but  it’s  meaningless  to  try  to  convert  centimeters  into 
seconds,  because  length  and  time  are  different  quantities  -  they  have  different  dimensions. 

We  will  encounter  only  three  different  basic  quantities,  which  have  the  dimensions  of  length  (L),  time 
(T),  and  mass  (M).  Thanks  to  the  Napoleonic  conquest  of  Europe  in  the  early  1800s,  we  have  a  basic  unit 
for  each  of  these:  meters  (m)  for  length,  seconds  (s)  for  time,  and  kilograms  (kg)  for  mass.  Although  we 
won’t  encounter  them  here,  the  standard  system  of  units  (called  the  Systeme  International,  or  SI)  has  four 
more  of  these  basic  pairs:  (electric)  current  I,  measured  in  Amperes  (A),  temperature  T,  measured  in  Kelvin 
(K),  luminosity  /,  measured  in  candelas  (cd),  and  ‘amount  of  stuff’,  measured  in  moles  (mol),  see  table 
Unfortunately,  although  this  system  is  commonly  used  in  (continental)  Europe  and  in  many  other  parts  of 
the  world,  it  is  not  everywhere,  notably  in  the  US,  where  people  persist  in  using  such  things  as  inches  and 
pounds,  so  you’ll  often  have  to  convert  between  units. 

From  the  seven  basic  quantities  in  the  SI,  all  others  can  be  derived.  For  example,  speed  is  defined  as  the 
distance  traveled  (length)  divided  by  the  time  it  took,  so  speed  has  the  dimension  of  LI  T  and  is  measured  in 
units  of  m/s.  Note  that  in  order  to  be  able  to  compare  two  quantities,  they  must  have  the  same  dimension. 
This  simple  observation  has  an  important  consequence:  in  any  physics  equation,  the  dimensions  on  both 
sides  of  the  equality  sign  always  have  to  be  the  same.  There’s  no  bargaining  on  this  point:  equating  two 
quantities  with  different  dimensions  does  not  make  any  kind  of  sense,  so  if  you  find  that  that’s  what  you’re 
doing  at  any  point,  backtrack  and  find  where  things  went  wrong. 

1.2.  Dimensional  analysis 

Although  you  will  of  course  need  a  complete  physical  model  (represented  as  a  set  of  mathematical  equations) 
to  fully  describe  a  physical  system,  you  can  get  surprisingly  far  with  a  simple  method  that  requires  no  detailed 
knowledge  at  all.  This  method  is  known  as  dimensional  analysis,  and  based  on  the  observation  in  the  previ¬ 
ous  section  that  the  two  sides  of  any  physical  equation  have  to  have  the  same  dimension.  You  can  use  this 
principle  to  qualitatively  understand  a  system,  and  make  predictions  on  how  it  will  respond  quantitatively 
if  you  change  some  parameter.  To  understand  how  dimensional  analysis  works,  an  example  is  probably  the 
most  effective  -  we’ll  take  one  that  is  ubiquitous  in  classical  mechanics:  a  mass  oscillating  on  a  spring  (known 
as  the  harmonic  oscillator ),  see  figure 

1.2.1.  Worked  example:  Dimensional  analysis  of  the  harmonic  oscillator 

Consider  the  harmonic  oscillator  consisting  of  a  mass  of  magnitude  m,  suspended  on  a  spring  with  spring 
constant  1c.  If  you  pull  down  the  mass  a  bit  and  release,  it  will  oscillate  with  a  frequency  oj.  Can  we  predict 
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how  this  frequency  will  change  if  we  double  the  mass? 

There  are  two  ways  to  answer  this  question.  One  is  to  consider  all  the  forces  acting  on  the  mass,  then 
use  Newton’s  second  law  to  derive  a  differential  equation  (known  as  the  equation  of  motion)  for  the  mass, 
solve  it,  and  from  the  solution  determine  what  happens  if  we  change  the  mass.  The  second  is  to  consider  the 
dimensions  of  the  quantities  involved.  We  have  a  mass,  which  has  dimension  of  mass  (M),  as  it  is  one  of  our 
basic  quantities.  We  have  a  spring  with  spring  constant  k,  which  has  dimensions  of  force  per  unit  length,  or 
mass  per  unit  time  squared: 

[, k]  =  FIL=  MLT~2IL=  MIT 2.  (1.1) 

Note  the  notation  [ k\  for  the  dimension  of  k.  For  the  frequency,  we  have  [o>]  =  1  IT.  Now  we  know  that  the 
frequency  is  a  function  of  the  spring  constant  and  the  mass,  and  that  both  sides  of  that  equation  must  have 
the  same  sign.  Since  there  is  no  mass  in  the  dimension  of  the  frequency,  but  it  exists  in  the  dimension  of 
both  the  spring  constant  and  the  mass,  we  know  that  (>)  must  depend  on  the  ratio  of  k  and  m:  a)  ~  k/m.  Now 
[k/m]  —  \IT2,  and  from  [w]  =  1/7',  we  conclude  that  we  must  have 

a)  ~  \fkhn.  (1.2) 

Equation  (  )  allows  us  to  answer  our  question  immediately:  if  we  double  the  mass,  the  frequency  will  de¬ 

crease  by  a  factor  \[2. 

Note  that  in  equation  (  .2)  I  did  not  write  an  equals  sign,  but  a  ‘scales  as’  sign  (~,  sometimes  also  written 
as  ex).  That  is  because  dimensional  analysis  will  not  tell  us  about  any  numerical  factor  that  may  appear  in 
the  expression,  as  those  numerical  factors  have  no  unit  (or,  more  correctly,  have  no  dimension  -  they  are 
dimensionless ). 

You  may  object  that  there  might  be  another  factor  at  play:  shouldn’t  gravity  matter?  The  answer  is  no,  as 
we  can  also  quickly  see  from  dimensional  analysis.  The  force  of  gravity  is  given  by  mg,  introducing  another 
parameter  g  (the  gravitational  acceleration)  with  dimension  [g]  -  LIT2.  Now  if  the  frequency  were  to  depend 
on  g,  there  has  to  be  another  factor  to  cancel  the  dependence  on  the  length,  as  the  frequency  itself  is  length- 
independent.  Neither  m  nor  Ic  has  a  length-dependence  in  its  dimension,  and  so  they  cannot  ‘kill’  the  L  in  the 
dimension  of  g;  the  frequency  therefore  also  cannot  depend  on  g  -  which  we  have  now  figured  out  without 
invoking  any  (differential)  equations! 

Above,  I’ve  sketched  how  you  can  use  dimensional  analysis  to  arrive  at  a  physical  scaling  relation  through 
inspection:  we’ve  combined  the  various  factors  to  arrive  at  the  right  dimension.  Such  combinations  are  not 
always  that  easy  to  see,  and  in  any  case,  you  may  wonder  if  you’ve  correctly  spotted  them  all.  Fortunately, 
there  is  a  more  robust  method,  that  we  can  also  use  to  once  again  show  that  the  frequency  is  independent 
of  the  gravitational  acceleration.  Suppose  that  in  general  co  could  depend  on  k,  m  and  g.  The  functional 
dependence  can  then  be  written  as 

M  =  [kamPgr]  =  (M/r2)“M^(L/r2)r  =  Ma+pT~2(a+r)Lr,  (1.3) 

which  leads  to  three  equations  for  the  exponents: 


a  +  /3  —  0, 

-2  (a  +  y)  =  -1, 
y  =  0, 

which  you  can  easily  solve  to  find  a  =  1/2,  p  -  -1/2,  y  =  0,  which  gives  us  equation  (  ).  This  method  will 

allow  you  to  get  dimensional  relations  in  surprisingly  many  different  cases,  and  is  used  by  most  physicist  as  a 
first  line  of  attack  when  they  first  encounter  an  unknown  system. 


2  The  actual  function  may  of  course  contain  multiple  terms  which  are  summed,  but  all  those  must  have  the  same  dimension.  Operators 
like  sines  and  exponentials  must  be  dimensionless,  as  there  are  no  dimensions  of  the  form  sin(M)  or  e!.  The  only  allowable  dimensional 
dependencies  are  thus  power  laws. 

'The  method  is  sometimes  referred  to  as  the  Rayleigh  algorithm,  after  John  William  Strutt,  Lord  Rayleigh  (1842-1919),  who  applied  it, 
among  other  things,  to  light  scattering  in  the  air.  The  result  of  Rayleigh’s  analysis  can  be  used  to  explain  why  the  sky  is  blue. 
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1.3.  Problems 

1.1  Harmonic  oscillator  revisited  Suppose  you  have  a  small  object  of  mass  m,  which  you  attach  to  a  spring 
of  spring  constant  1c  (which  itself  is  fixed  to  a  wall  at  its  other  end,  figure  ).  Above,  we  derived  an 
expression  for  the  frequency  of  oscillation  of  the  mass.  We  also  argued  that  it  should  be  the  same  for  both 
a  horizontally-positioned  and  a  vertically-positioned  oscillator,  i.e.,  that  the  frequency  is  independent  of 
the  gravitational  acceleration  g. 

(a)  Show  that  the  frequency  of  oscillation  is  also  independent  of  its  amplitude  A  (the  maximum  distance 
from  the  equilibrium  position  the  oscillating  mass  reaches). 

(b)  Use  dimensional  analysis  to  derive  an  expression  for  the  maximum  velocity  of  the  mass  during  the 
oscillation,  as  a  function  of  m,  k,  and  A. 

1.2  In  physics,  we  assume  that  quantities  like  the  speed  of  light  (c)  and  Newton’s  gravitational  constant  (G) 
have  the  same  value  throughout  the  universe,  and  are  therefore  known  as  physical  constants.  A  third 
such  constant  from  quantum  mechanics  is  Planck’s  constant  [h,  an  h  with  a  bar) .  In  high-energy  physics, 
people  deal  with  processes  that  occur  at  very  small  length  scales,  so  our  regular  Si-units  like  meters  and 
seconds  are  not  very  useful.  Instead,  we  can  combine  the  fundamental  physical  constants  into  different 
basis  values. 

(a)  Combine  c,  G  and  h  into  a  quantity  that  has  the  dimensions  of  length. 

(b)  Calculate  the  numerical  value  of  this  length  in  SI  units  (this  is  known  as  the  Planck  length).  You  can 
find  the  numerical  values  of  the  physical  constants  in  appendix  B. 

(c)  Similarly,  combine  c,  G  and  h  into  a  quantity  that  has  the  dimensions  of  energy  (indeed,  known  as 
the  Planck  energy)  and  calculate  its  numerical  value. 

1.3  Reynolds  numbers  Physicists  often  use  dimensionless  quantities  to  compare  the  magnitude  of  two  phys¬ 
ical  quantities.  Such  numbers  have  two  major  advantages  over  quantities  with  numbers.  First,  as  di¬ 
mensionless  quantities  carry  no  units,  it  does  not  matter  which  unit  system  you  use,  you’ll  always  get  the 
same  value.  Second,  by  comparing  quantities,  the  concepts  ‘big’  and  ‘small’  are  well-defined,  unlike  for 
quantities  with  a  dimension  (for  example,  a  distance  may  be  small  on  human  scales,  but  very  big  for  a 
bacterium) .  Perhaps  the  best  known  example  of  a  dimensionless  quantity  is  the  Reynolds  number  in  fluid 
mechanics,  which  compares  the  relative  magnitude  of  inertial  and  drag  forces  acting  on  a  moving  object: 

inertial  forces  p  vL 

Re  =  — - - - -  — ,  (1.4) 

drag  forces  q 

where  p  is  the  density  of  the  fluid  (either  a  liquid  or  a  gas),  v  the  speed  of  the  object,  L  its  size,  and  q  the 
viscosity  of  the  fluid.  Typical  values  of  the  viscosity  are  1.0  mPa  •  s  for  water,  50  mPa  •  s  for  ketchup,  and 
1.0  pPa-  s  for  air. 

(a)  Estimate  the  typical  Reynolds  number  for  a  duck  when  flying  and  when  swimming  (you  may  assume 
that  the  swimming  happens  entirely  submerged).  NB:  This  will  require  you  looking  up  or  making 
educated  guesses  about  some  properties  of  these  birds  in  motion.  In  either  case,  is  the  inertial  or  the 
drag  force  dominant? 

(b)  Estimate  the  typical  Reynolds  number  for  a  swimming  bacterium.  Again  indicate  which  force  is  dom¬ 
inant. 

(c)  Oil  tankers  that  want  to  make  port  in  Rotterdam  already  put  their  engines  in  reverse  hallway  across 
the  North  sea.  Explain  why  they  have  to  do  so. 

(d)  Express  the  Reynolds  number  for  the  flow  of  water  through  a  (circular)  pipe  as  a  function  of  the  ra¬ 
dius  R  of  the  pipe,  the  volumetric  flow  rate  (i.e.,  volume  per  second  that  flows  through  the  pipe)  Q, 
and  the  kinematic  viscosity  v  =  77/p. 

(e)  For  low  Reynolds  number,  fluids  will  typically  exhibit  so-called  laminar  flow,  in  which  the  fluid  par¬ 
ticles  all  follow  paths  that  nicely  align  (this  is  the  transparent  flow  of  water  from  a  tap  at  low  flux). 
For  higher  Reynolds  number,  the  flow  becomes  turbulent,  with  many  eddies  and  vortices  (the  white¬ 
looking  flow  of  water  from  the  tap  you  observe  when  increasing  the  flow  rate).  The  maximum  Rey¬ 
nolds  number  for  which  the  flow  is  typically  laminar  is  experimentally  measured  to  be  about  2300. 
Estimate  the  flow  velocity  and  volumetric  flow  rate  of  water  from  a  tap  with  a  1.0  cm  diameter  in  the 
case  that  the  flow  is  just  laminar. 
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1.4  The  escape  velocity  of  a  planet  is  defined  as  the  minimal  initial  velocity  an  object  must  have  to  escape 
its  gravitational  pull  completely  (and  thus  go  fast  enough  to  defy  the  rule  that  ‘what  goes  up  must  come 
down’). 

(a)  From  Newton’s  universal  law  of  gravitation  (equation  2.9),  determine  the  dimension  of  the  gravita¬ 
tional  constant  G. 

(b)  Use  dimensional  analysis  to  show  that  for  a  planet  of  mass  M  and  radius  R,  the  escape  velocity  scales 
as  v  ~  VMGIR. 

(c)  A  more  detailed  calculation  shows  that  in  fact  we  have  Escape  =  V2GM/R.  Express  this  value  of  the 
escape  velocity  in  terms  of  the  (mass)  density  p  of  the  planet,  instead  of  its  mass  M. 

(d)  The  average  density  of  the  moon  is  about  6/10th  that  of  the  Earth,  and  the  Moon’s  radius  is  about 
11/40  times  that  of  the  Earth.  From  these  numbers  and  your  answer  at  (c),  calculate  the  ratio  of  the 
escape  velocities  of  the  Moon  and  the  Earth,  and  explain  why  the  Apollo  astronauts  needed  a  huge 
rocket  to  get  to  the  Moon,  and  only  a  tiny  one  to  get  back. 


Forces 


2.1.  wton’s  laws  of  motion 

As  described  in  chapter  ,  classical  mechanics  is  based  on  a  set  of  axioms,  which  in  turn  are  based  on  (re¬ 
peated)  physical  observations.  In  order  to  formulate  the  first  three  axioms,  we  will  need  to  first  define  three 
quantities:  the  (instantaneous)  velocity,  acceleration  and  momentum  of  a  particle.  If  we  denote  the  position 
of  a  particle  as  x(t)  -  indicating  a  vector  quantity  with  the  dimension  of  length  that  depends  on  time,  we 
define  its  velocity  as  the  time  derivative  of  the  position: 


v(t)  =  jfc(f)  = 


dx(r) 

df 


(2.1) 


Note  that  we  use  an  overdot  to  indicate  a  time  derivative,  we  will  use  this  convention  throughout  these  notes. 
The  acceleration  is  the  time  derivative  of  the  velocity,  and  thus  the  second  derivative  of  the  position: 


a{t)  —  x(f)  = 


dv{t) 

df 


d2x(f) 

df2 


(2.2) 


Finally  the  momentum  of  a  particle  is  its  mass  times  its  velocity: 


pit)  =  mv[t)  -  mx[t).  (2.3) 

We  are  now  ready  to  give  our  next  three  axioms.  You  may  have  encountered  them  before;  they  are  known  as 
Newton’s  three  laws  of  motion. 

Axiom  1  (Newton’s  first  law  of  motion).  As  long  as  there  is  no  external  action,  a  particle’s  velocity  will  remain 
constant. 

Note  that  the  first  law  includes  particles  at  rest,  i.e.,  v  -  0.  We  will  define  the  general  ‘external  action’  as  a 
force,  therefore  a  force  is  now  anything  that  can  change  the  velocity  of  a  particle.  The  second  law  quantifies 
the  force. 


Axiom  2  (Newton’s  second  law  of  motion).  If  there  is  a  net  force  acting  on  a  particle,  then  its  instantaneous 
change  in  momentum  due  to  that  force  is  equal  to  that  force: 


F[t)  = 


dp  it) 
df 


(2.4) 


Now  since  p-mv  and  a  —  dvl  df,  if  the  mass  is  constant  we  can  also  write  (  )  as  F  —  ma,  or 


Fit)  -  mxit), 


(2.5) 


which  is  the  form  in  we  will  use  most.  Based  on  the  second  law,  we  see  that  a  force  has  the  physical  dimen¬ 
sion  of  a  mass  times  a  length  divided  by  a  time  squared  -  since  this  is  quite  a  lot  to  put  in  every  time,  we 

Appendix  A.  lists  some  basic  properties  of  vectors  that  you  may  find  useful. 
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Isaac  Newton  (1642-1727)  was  a  British  physicist,  astronomer  and  mathe¬ 
matician,  who  is  widely  regarded  as  one  of  the  most  important  scientists  in 
history.  Newton  was  a  professor  at  Cambridge  from  1667  till  1702,  where 
he  held  the  famous  Lucasian  chair  in  mathematics.  Newton  invented  in¬ 
finitesimal  calculus  to  be  able  to  express  the  laws  of  mechanics  that  now 
bear  his  name  (section  )  in  mathematical  form.  He  also  gave  a  mathe¬ 
matical  description  of  gravity  (equation  .9),  from  which  he  could  derive 
Kepler’s  laws  of  planetary  motion  (section  ).  In  addition  to  his  work 
on  mechanics,  Newton  made  key  contributions  to  optics  and  invented  the 
reflection  telescope,  which  uses  a  mirror  rather  than  a  lens  to  gather  light. 
Having  retired  from  his  position  in  Cambridge,  Newton  spend  most  of  the 
second  half  of  his  life  in  London,  as  warden  and  later  master  of  the  Royal 
mint,  and  president  of  the  Royal  society. 


Figure  2.1:  Portrait  of  Isaac  New¬ 
ton  by  Godfrey  Kneller  (1689)  [2] . 


define  the  dimension  of  force  as  such:  F  =  MLT~2.  Likewise,  we  define  a  unit,  the  Newton  (N),  as  a  kilogram 
times  a  meter  per  second  squared:  N  =  kg  •  m/s2.  Therefore,  in  principle  Newton’s  second  law  of  motion  can 
also  be  used  to  measure  forces,  though  we  will  often  use  it  the  other  way  around,  and  calculate  changes  in 
momentum  due  to  a  known  force. 

Note  how  Newton’s  first  law  follows  from  the  second:  if  the  force  is  zero,  there  is  no  change  in  momentum, 
and  thus  (assuming  constant  mass)  a  constant  velocity.  Note  also  that  although  the  second  law  gives  us  a 
quantification  of  the  force,  by  itself  it  will  not  help  us  achieve  much,  as  we  at  present  have  no  idea  what  the 
force  is  (though  you  probably  have  some  intuitive  ideas  from  experience)  -  for  that  we  will  use  the  force  laws 
of  the  next  section.  Before  we  go  there,  there  is  another  important  observation  on  the  nature  of  forces  in 
general. 

Axiom  3  (Newton’s  third  law  of  motion).  If  a  body  exerts  a  force  F\  on  a  second  body,  the  second  body  exerts  an 
equal  but  opposite  force  F2,  on  the  first,  i.e.,  the  forces  are  equal  in  magnitude  but  opposite  in  direction: 

Fi  =  -F2.  (2.6) 


2.2.  Force  laws 

Newton’s  second  law  of  motion  tells  us  what  a  force  does :  it  causes  a  change  in  momentum  of  any  particle  it 
acts  upon.  It  does  not  tell  us  where  the  force  comes  from,  nor  does  it  care  -  which  is  a  very  useful  feature,  as 
it  means  that  the  law  applies  to  all  forces.  However,  we  do  of  course  need  to  know  what  to  put  down  for  the 
force,  so  we  need  some  rule  to  determine  it  independently.  This  is  where  the  force  laws  come  in. 

2.2.1.  Springs:  Hooke’s  law 

One  very  familiar  example  of  a  force  is  the  spring  force:  you  need  to  exert  a  force  on  something  to  compress 
it,  and  (in  accordance  with  Newton’s  third  law) ,  if  you  press  on  something  you’ll  feel  it  push  back  on  you.  The 
simplest  possible  object  that  you  can  compress  is  an  ideal  spring,  for  which  the  force  necessary  to  compress 
it  scales  linearly  with  the  compression  itself.  This  relation  is  known  as  Hooke’s  law: 

Fs  =  -/cx,  (2.7) 

where  x  is  now  the  displacement  (from  rest)  and  k  is  the  spring  constant,  measured  in  newtons  per  meter. 
The  value  of  k  depends  on  the  spring  in  question  -  stiffer  springs  having  higher  spring  constants. 

Hooke’s  law  gives  us  another  way  to  measure  forces.  We  have  already  defined  the  unit  of  force  using 
Newton’s  second  law  of  motion,  and  we  can  use  that  to  calibrate  a  spring,  i.e.,  determine  its  spring  constant, 
by  determining  the  displacement  due  to  a  known  force.  Once  we  have  k,  we  can  simply  measure  forces  by 
measuring  displacements  -  this  is  exactly  what  a  spring  scale  does. 


2.2.  Force  laws 
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Robert  Hooke  (1635-1703)  was  a  British  all-round  natural  scientist  and  ar¬ 
chitect.  He  discovered  the  force  law  named  after  him  in  1660,  which  he 
published  first  as  an  anagram:  ‘ceiiinosssttuv’,  so  he  could  claim  the  dis¬ 
covery  without  actually  revealing  it  (a  fairly  common  practice  at  the  time) ; 
he  only  provided  the  solution  in  1678:  ‘ut  tensio,  sic  vis’  (‘as  the  exten¬ 
sion,  so  the  force’).  Hooke  made  many  contributions  to  the  development 
of  microscopes,  using  them  to  reveal  the  structure  of  plants,  coining  the 
word  cell  for  their  basic  units.  Hooke  was  the  curator  of  experiments  of 
England’s  Royal  Society  for  over  40  years,  combining  this  position  with  a 
professorship  in  geometry  and  the  job  of  surveyor  of  the  city  of  London 
after  the  great  fire  of  1666.  In  the  latter  position  he  got  a  strong  reputation 
for  a  hard  work  and  great  honesty.  At  the  same  time,  he  was  frequently  at 
odds  with  his  contemporaries  Isaac  Newton  and  Christiaan  Huygens;  it  is 
not  unlikely  that  they  independently  developed  similar  notions  on,  among 
others  on  the  inverse-square  law  of  gravity. 


Figure  2.2:  Drawing  of  the  cell 
structure  of  cork  by  Hooke,  from 
his  1665  book  Micrographia  [3]. 
No  portraits  of  Hooke  survive. 


2.2.2.  Gravity:  Newton’s  law  of  gravity 

A  second  and  probably  even  more  familiar  example  is  force  due  to  gravity,  at  the  local  scale,  i.e.,  around  you, 
in  the  approximation  that  the  Earth  is  flat.  Anything  that  has  mass  attracts  everything  else  that  has  mass, 
and  since  the  Earth  is  very  massive,  it  attracts  all  objects  in  the  space  around  you,  including  yourself.  Since 
the  force  of  gravity  is  weak,  you  won’t  feel  the  pull  of  your  book,  but  since  the  Earth  is  so  massive,  you  do 
feel  its  pull.  Therefore  if  you  let  go  of  something,  it  will  be  accelerated  towards  the  Earth  due  to  its  attracting 
gravitational  force.  As  demonstrated  by  Galilei  (and  some  guys  in  spacesuits  on  a  rock  we  call  the  moon  ), 
the  acceleration  of  any  object  due  to  the  force  of  gravity  is  the  same,  and  thus  the  force  exerted  by  the  Earth 
on  any  object  equals  the  mass  of  that  object  times  this  acceleration,  which  we  call  g: 


Fg  =  mg. 


(2.8) 


Because  the  Earth’s  mass  is  not  exactly  uniformly  distributed,  the  magnitude  of  g  varies  slightly  from  place  to 
place,  but  to  good  approximation  equals  9.81  m/s2.  It  always  points  down. 

Although  equation  (2.8)  for  local  gravity  is  handy,  its  range  of  application  is  limited  to  everyday  objects  at 
everyday  altitudes  -  say  up  to  a  couple  thousand  kilograms  and  a  couple  kilometers  above  the  surface  of  the 
Earth,  which  is  tiny  compared  to  Earth’s  mass  and  radius.  For  larger  distances  and  bodies  with  larger  mass 
-  say  the  Earth-Moon,  or  Earth-Sun  systems  -  we  need  something  else,  namely  Newton’s  law  of  gravitation 
between  two  bodies  with  masses  m\  and  m2  and  a  distance  r  apart: 


Fg  =  ~G 


m  1  m2  „ 

- 2~r’ 

r2 


(2.9) 


where  r  is  the  unit  vector  pointing  along  the  line  connecting  the  two  masses,  and  the  proportionality  constant 
G  =  6.67- 10-11  N  m2/kg2  is  known  as  the  gravitational  constant  (or  Newton’s  constant).  The  minus  sign 
indicates  that  the  force  is  attractive.  Equation  (2.9)  allows  you  to  actually  calculate  the  gravitational  pull  that 
your  book  exerts  on  you,  and  understand  why  you  don’t  feel  it.  It  also  lets  you  calculate  the  value  of  g  -  simply 
fill  in  the  mass  and  radius  of  the  Earth.  If  you  wish  to  know  the  value  of  g  on  any  other  celestial  body,  you 
can  put  in  its  particulars,  and  compare  with  Earth.  You’ll  find  you’d  ‘weigh’  3  times  less  on  Mars  and  6  times 
less  on  the  Moon.  Most  of  the  time  we  can  safely  assume  the  Earth  is  flat  and  use  (2.8),  but  in  particular  for 
celestial  mechanics  and  when  considering  satellites  we’ll  need  to  use  (2.9). 


2To  be  precise,  astronaut  David  Scott  of  the  Apollo  15  mission  in  1971 ,  who  dropped  both  a  hammer  and  a  feather  and  saw  them  fall  at 
exactly  the  same  rate,  as  shown  in  this  NASA  movie. 
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2.  Forces 


Galileo  Galilei  (1564-1642)  was  an  Italian  physicist  and  astronomer,  who 
is  widely  regarded  as  one  of  the  founding  figures  of  modern  science.  Un¬ 
like  classical  philosophers,  Galilei  championed  the  use  of  experiments  and 
observations  to  validate  (or  disprove)  scientific  theories,  a  practice  that  is 
the  cornerstone  of  the  scientific  method.  He  pioneered  the  use  of  the  tele¬ 
scope  (newly  invented  at  the  time)  for  astronomical  observations,  leading 
to  his  discovery  of  mountains  on  the  moon  an  the  four  largest  moons  of 
Jupiter  (now  known  as  the  Galilean  moons  in  his  honor).  On  the  theo¬ 
retical  side,  Galilei  argued  that  Aristotle’s  argument  that  heavy  objects  fall 
faster  than  light  ones  is  incorrect,  and  that  the  acceleration  due  to  grav¬ 
ity  is  equal  for  all  objects  (equation  2.8).  Galilei  also  strongly  advocated 
the  heliocentric  worldview  introduced  by  Copernicus  in  1543,  as  opposed 
to  the  widely-held  geocentric  view.  Unfortunately,  the  Inquisition  thought 
otherwise,  leading  to  his  conviction  for  heresy  with  a  sentence  of  life-long 
house  arrest  in  1633,  a  position  that  was  only  recanted  by  the  church  in 
1995. 


Figure  2.3:  Portrait  of  Galileo 
Galilei  by  Justus  Sustermans 
(1636)  [4], 


2.2.3.  Electrostatics:  Coulomb’s  law 


Like  two  masses  interact  due  to  the  gravitational  force,  two  charged  objects  interact  via  Coulomb’s  force. 
Because  charge  has  two  possible  signs,  Coulomb’s  force  can  both  be  attractive  (between  opposite  charges) 
and  repulsive  (between  identical  charges).  Its  mathematical  form  strongly  resembles  that  of  Newton’s  law  of 
gravity: 


-Fc  = 


ke 


r2  r 


(2.10) 


where  q\  and  £72  are  the  signed  magnitudes  of  the  charges,  r  is  again  the  distance  between  them,  and  ke  - 
8.99  •  109  N  •  m2/C2  is  Coulomb’s  constant.  For  everyday  length  and  force  scales,  Coulomb’s  force  is  much 
larger  than  the  force  of  gravity. 


2.2.4. 

Why  did  it  take  the  genius  of  Galilei  and  Newton  to  uncover  Newton’s  first  law  of  motion?  Because  everyday 
experience  seems  to  contradict  it:  if  you  don’t  exert  a  force,  you  won’t  keep  moving,  but  gradually  slow  down. 
You  know  of  course  why  this  is:  there’s  drag  and  friction  acting  on  a  moving  body,  which  is  why  it’s  much 
easier  (though  not  necessarily  handier)  for  a  car  to  keep  moving  on  ice  than  on  a  regular  tarmac  road  (less 
friction  on  ice),  and  why  walking  through  water  is  so  much  harder  than  walking  through  air  (more  drag  in 
water).  The  medium  in  which  you  move  can  exert  a  drag  force  on  you,  and  the  surface  over  which  you  move 
exerts  friction  forces.  These  of  course  are  the  forces  responsible  for  slowing  you  down  when  you  stop  exerting 
force  yourself,  so  the  first  law  doesn’t  apply,  as  there  are  forces  acting. 

For  low  speeds,  the  drag  force  typically  scales  linearly  with  the  velocity  of  the  moving  object.  Drag  forces 
for  objects  moving  through  a  (fluid)  medium  moreover  depend  on  the  properties  of  the  medium  (its  viscosity 
77)  and  the  cross-sectional  area  of  the  moving  object.  For  a  sphere  of  radius  R  moving  at  velocity  v,  the  drag 
force  is  given  by  Stokes’  law: 

Fd  =  —6mjRv.  (2.11) 

The  more  general  version  for  an  object  of  arbitrary  shape  is  F^  =  (v,  where  (  is  a  proportionality  constant. 
Stokes’  law  breaks  down  at  high  velocities,  for  which  the  drag  force  scales  quadratically  with  the  speed: 

Fd  =  ~Pcd^y2,  (2.12) 

where  p  is  the  density  of  the  fluid,  A  the  cross-sectional  area  of  the  object,  v  it’s  speed,  and  c,j  it’s  dimension¬ 
less  drag  coefficient,  which  depends  on  the  object’s  shape  and  surface  properties.  Typical  values  for  the  drag 
coefficient  are  1.0  for  a  cyclist,  1.2  for  a  running  person,  0.48  for  a  Volkswagen  Beetle,  and  0.19  for  a  modern 
aerodynamic  car.  The  direction  of  the  drag  force  is  still  opposite  that  of  the  motion. 

Frictional  forces  are  due  to  two  surfaces  sliding  past  each  other.  It  should  come  as  no  surprise  that  the 
direction  of  the  frictional  force  is  opposite  that  of  the  motion,  and  its  magnitude  depends  on  the  properties  of 
the  surfaces.  Moreover,  the  magnitude  of  the  frictional  force  also  depends  on  how  strongly  the  two  surfaces 


2.3.  Equations  of  motion 
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Charles-Augustin  de  Coulomb  (1736-1806)  was  a  French  physicist  and 
military  engineer.  For  most  of  his  working  life,  Coulomb  served  in  the 
French  army,  for  which  he  supervised  many  construction  projects.  As  part 
of  this  job,  Coulomb  did  research,  first  in  mechanics  (leading  to  his  law 
of  kinetic  friction,  equation  2  ),  and  later  in  electricity  and  magnetism, 

for  which  he  discovered  that  the  force  between  charges  (and  those  be¬ 
tween  magnetic  poles)  drops  off  quadratically  with  their  distance  (equa¬ 
tion  L0).  Near  the  end  of  his  life,  Coulomb  participated  in  setting  up  the 
SI  system  of  units. 


Figure  2.4:  Portrait  of  Charles  de 
Coulomb  [5]. 


are  pushed  against  each  other  -  i.e.,  on  the  forces  they  exert  on  each  other,  perpendicular  to  the  surface.  These 
forces  are  of  course  equal  (by  Newton’s  third  law)  and  are  called  normal  forces,  because  they  are  normal  (that 
is,  perpendicular)  to  the  surface.  If  you  stand  on  a  box,  gravity  exerts  a  force  on  you  pulling  you  down,  which 
you  ‘transfer’  to  a  force  you  exert  on  the  top  of  the  box,  and  causes  an  equal  but  opposite  normal  force  exerted 
by  the  top  of  the  box  on  your  feet.  If  the  box  is  tilted,  the  normal  force  is  still  perpendicular  to  the  surface  (it 
remains  normal),  but  is  no  longer  equal  in  magnitude  to  the  force  exerted  on  you  by  gravity.  Instead,  it  will 
be  equal  to  the  component  of  the  gravitational  force  along  the  direction  perpendicular  to  the  surface  (see 
figure  2.6).  We  denote  normal  forces  as  Fn.  Now  according  to  the  Coulomb  friction  law  (not  to  be  confused 
with  the  Coulomb  force  between  two  charged  particles),  the  magnitude  of  the  frictional  force  between  two 
surfaces  satisfies 

Ff  <  pF„.  (2.13) 

Here  /i  is  the  coefficient  of  friction,  which  of  course  depends  on  the  two  surfaces,  but  also  on  the  question 
whether  the  two  surfaces  are  moving  with  respect  to  each  other  or  not.  If  they  are  not  moving,  i.e.,  the  con¬ 
figuration  is  static,  the  appropriate  coefficient  is  called  the  coefficient  of  static  friction  and  denoted  by  /is. 
The  actual  magnitude  of  the  friction  force  will  be  such  that  it  balances  the  other  forces  (more  on  that  in  sec¬ 
tion  2.‘ ).  Equation  (2.13)  tells  us  that  this  is  only  possible  if  the  required  magnitude  of  the  friction  force  is 
less  than  qsFn.  When  things  start  moving,  the  static  friction  coefficient  is  replaced  by  the  coefficient  of  kinetic 
friction  p^,  which  is  usually  smaller  than  ps;  also  in  that  case  the  inequality  in  equation  (2.13)  gets  replaced 
by  an  equals  sign,  and  we  have  Ff  = 

2.3.  Equations  of  motion 

Now  that  we  have  set  our  axioms  -  Newton’s  laws  of  motion  and  the  various  force  laws  -  we  are  ready  to 
start  combining  them  to  get  useful  results,  things  that  we  did  not  put  into  the  axioms  in  the  first  place  but 
follow  from  them.  The  first  thing  we  can  do  is  write  down  equations  of  motion:  an  equation  that  describes 
the  motion  of  a  particle  due  to  the  action  of  a  certain  type  of  force.  For  example,  suppose  you  take  a  rock  of 
a  certain  mass  m  and  let  go  of  it  at  some  height  h  above  the  ground,  then  what  will  happen?  Once  you’ve  let 
go  of  the  rock,  there  is  only  one  force  acting  on  the  rock,  namely  Earth’s  gravity,  and  we  are  well  within  the 
regime  where  equation  (2.8)  applies,  so  we  know  the  force.  We  also  know  that  this  net  force  will  result  in  a 
change  of  momentum  (equation  2.  ),  which,  because  the  rock  won’t  loose  any  mass  in  the  process  of  falling, 
can  be  rewritten  as  (  ) .  By  equating  the  forces  we  arrive  at  an  equation  of  motion  for  the  rock,  which  in  this 
case  is  very  simple: 

mg -mix.  (2.14) 

We  immediately  see  that  the  mass  of  the  rock  doesn’t  matter  (Galilei  was  right!  -  though  of  course  he  was  in 
our  set  of  axioms,  because  we  arrived  at  them  by  assuming  he  was  right...).  Less  trivially,  equation  (  )  is 

a  second-order  differential  equation  for  the  motion  of  the  rock,  which  means  that  in  order  to  find  the  actual 
motion,  we  need  two  initial  conditions  -  which  in  our  present  example  are  that  the  rock  starts  at  height  h  and 
zero  velocity. 
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Figure  2.5:  Dropping  under  the  force  of  gravity,  (a  and  b)  A  ball  released  from  rest  drops  with  a  constant  acceleration,  resulting  in  a 
constantly  increasing  velocity.  Images  in  (a)  are  taken  every  0.05  s;  distances  are  multiples  of  12  mm.  In  (b),  the  trajectory  of  the  ball 
resulting  from  repeated  bounces  is  shown  with  intervals  of  0.04  s  [(  ],  CC  BY-SA  3.0.  (c)  Paragliders  need  to  balance  the  force  of  gravity 
and  that  of  drag  to  stop  accelerating  and  fall  at  a  continuous  speed  (known  as  their  terminal  velocity)  [  ] ,  CC  BY-SA  3.0. 


Equation  (  )  is  essentially  one-dimensional  -  all  motion  occurs  along  the  vertical  line.  Solving  it  is 

therefore  straightforward  -  you  simply  integrate  over  time  twice.  The  general  solution  is: 

x{t)  =  x{Q)  +  v{P)t+-gt2,  (2.15) 

which  with  our  boundary  conditions  becomes 

x(f)  =  (h-igf2)z,  (2.16) 

where  g  is  the  magnitude  of  g  (which  points  down,  hence  the  minus  sign).  Of  course  equation  (2.16)  breaks 
down  when  the  rock  hits  the  ground  at  t-  y/Thlg,  which  is  easily  understood  because  at  that  point  gravity  is 
no  longer  the  only  force  acting  on  it. 

We  can  also  immediately  write  down  the  equation  of  motion  for  a  mass  on  a  spring  (no  gravity  at  present), 
in  which  the  net  force  is  given  by  Hooke’s  law.  Equating  that  force  to  the  net  force  in  Newton’s  second  law  of 
motion  gives: 

-  kx{t)  -  mx{t) .  (2.17) 


Of  course,  we  find  another  second-order  differential  equation,  so  we  again  need  the  initial  position  and  veloc¬ 
ity  to  specify  a  solution.  The  general  solution  of  (2.1  )  is  a  combination  of  sines  and  cosines,  with  a  frequency 
oj-Vlcl  m  (as  we  already  know  from  the  dimensional  analysis  in  section  ,2): 


i/(0) 

x[t)  -  x(0)  cos(wf)  H - sin(wr). 

0) 


(2.18) 


We’ll  study  this  case  in  more  detail  in  section  8.1. 

In  general,  the  force  in  Newton’s  second  law  may  depend  on  time  and  position,  as  well  as  on  the  first 
derivative  of  the  position,  i.e.,  the  velocity.  For  the  special  case  that  it  depends  on  only  one  of  the  three  vari¬ 
ables,  we  can  write  down  the  solution  formally,  in  terms  of  an  integral  over  the  force.  These  formal  solutions 
are  given  in  section  2.6.  To  see  how  they  work  in  practice,  let’s  consider  a  slightly  more  involved  problem,  that 
of  a  stone  falling  with  drag. 


2.4.  Multiple  forces 
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2.3.1.  Worked  example:  falling  stone  with  drag 

Suppose  we  have  a  spherical  stone  of  radius  a  that  you  drop  from  a  height  halt-  0.  At  what  time,  and  with 
which  velocity,  will  the  stone  hit  the  ground?  We  already  solved  this  problem  in  the  simple  case  without  drag 
above,  but  now  let’s  include  drag.  There  are  then  two  forces  acting  on  the  stone:  gravity  (pointing  down)  with 
magnitude  mg,  and  drag  (pointing  in  the  direction  opposite  the  motion,  in  this  case  up)  with  magnitude 
Gmjav  =  bv,  as  given  by  Stokes’  law  (equation  2.1  ).  Our  equation  of  motion  is  now  given  by  (with  x  as  the 
height  of  the  particle,  and  the  downward  direction  as  positive): 

mx  =  -bx+mg.  (2.19) 


We  see  that  our  force  does  not  depend  on  time  or  position,  but  only  on  velocity  -  so  we  have  case  3  of  ap¬ 
pendix  2.6.  We  could  invoke  either  equation  (  )  or  (22  )  to  write  down  a  formal  solution,  but  there  is  an 

easier  way,  which  will  allow  us  to  evaluate  the  relevant  integrals  without  difficulty.  Since  our  equation  of  mo¬ 
tion  is  linear,  we  know  that  the  sum  of  two  solutions  is  again  a  solution.  One  of  the  terms  on  the  right  hand 
side  of  equation  (  )  is  constant,  which  means  that  our  equation  is  not  homogeneous  (we  can  rewrite  it  to 

mix.  +  bx  -  mg  to  see  this),  so  a  useful  thing  to  do  is  to  split  our  solution  in  a  homogeneous  and  a  particular 
part.  Rewriting  our  equation  in  terms  of  v  =  x  instead  of  x,  we  get  mv  +  bv-  mg,  from  which  we  can  immedi¬ 
ately  get  a  particular  solution:  vp  =  mg  lb,  as  the  time  derivative  of  this  constant  vp  vanishes.  Subtracting  vp, 
we  are  left  with  a  homogeneous  equation:  mv^+bv^,  which  we  now  solve  by  separation  of  variables.  First  we 
write  (>|,  =  du\J  d t,  then  re-arrange  so  that  all  factors  containing  U],  are  on  one  side  and  all  factors  containing 
t  are  on  the  other,  which  gives  -(mlb)(ll  P|,)  d  u\t  =  df.  We  can  now  integrate  to  get: 


t-  t0, 


(2.20) 


which  is  an  example  of  equation  (2.33).  After  rearranging  and  setting  to  —  0: 


fhU)  =  foexp 


(2.21) 


Note  that  this  homogeneous  solution  fits  our  intuition:  if  there  is  no  extra  force  on  the  particle,  the  drag  force 
will  slow  it  down  exponentially.  Also  note  that  we  didn’t  set  uq  =  0,  as  the  homogeneous  solution  does  not 
equal  the  total  solution.  Instead  Vq  is  an  integration  constant  that  we’ll  need  to  set  once  we’ve  written  down 
the  full  solution,  which  is: 


v{t)  =  vh(t)  +  vp (t)  =  Vo  exp 


m  )  b 


(2.22) 


Now  setting  f(0)  =  0  gives  vq  =  - mglb ,  so 


v[t)  = 


mg 


1  -  exp 


(2.23) 


To  get  x(f),  we  simply  integrate  v{  t)  over  time,  to  get: 


(2.24) 


We  can  find  when  the  stone  hits  the  ground  by  setting  x(f)  =  h  and  solving  for  f;  we  can  find  how  fast  it  is 
going  at  that  point  by  substituting  that  value  of  t  back  into  v(t). 


2.4.  Multiple  forces 

In  the  examples  in  section  there  was  only  a  single  force  acting  on  the  particle  of  interest.  Usually  there 
will  be  multiple  forces  acting  at  the  same  time,  not  necessarily  pulling  in  the  same  direction.  This  is  where 
vectors  come  into  play. 

Suppose  you  put  a  book  on  a  table.  The  Earth’s  gravity  pulls  it  down  with  a  force  of  magnitude  Fg.  Conse¬ 
quently  the  book  exerts  a  normal  force  down  on  the  table  with  the  same  magnitude,  and  the  table  reciprocates 
with  an  identical  but  oppositely  directed  normal  force  of  magnitude  Fn  =  Fg.  Now  suppose  you  push  against 
the  book  from  the  side  with  a  force  of  magnitude  F.  As  we’ve  seen  in  section  2.2,  there  will  then  be  a  fric¬ 
tion  force  between  the  book  and  the  table  in  the  opposite  direction,  which,  as  long  as  it  doesn’t  exceed  nsFn, 
equals  the  force  you  push  with.  However,  once  F  is  larger  than  iisFn,  there  will  be  a  net  force  acting  on  the 
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Figure  2.6:  Free  body  diagram  of  the  forces  acting  on  a  book  on  a  slanted  table.  Gravity  always  points  down,  normal  forces  always 
perpendicular  to  the  surface,  and  frictional  forces  always  parallel  to  the  surface.  The  force  of  gravity  can  be  decomposed  in  directions 
perpendicular  and  parallel  to  the  surface  as  well. 


book.  It  is  the  net  force  that  we  substitute  into  Newton’s  second  law,  and  from  which  the  book  will  get  a  net 
acceleration. 

In  the  situation  described  above,  things  are  still  simple  -  you  get  the  net  force  by  subtracting  the  kinetic 
friction  Ff  -  PfFn  from  the  force  F  you  exert  on  the  book,  because  these  are  horizontal  and  thus  perpen¬ 
dicular  to  the  vertical  normal  and  gravitational  forces.  But  what  happens  if  you  lift  the  table  on  one  end,  so 
that  it  becomes  slanted?  To  help  organize  our  thoughts,  we’ll  draw  a  free  body  diagram,  shown  in  figure  .6. 
Gravity  still  acts  downward,  and  the  mass  of  the  book  stays  the  same,  so  Fg  doesn’t  change.  However,  the 
orientation  of  the  contact  plane  between  the  book  and  table  does  change,  so  the  normal  force  (remember, 
normal  to  the  surface)  changes  as  well.  It’s  direction  will  remain  perpendicular  to  the  surface,  and  as  long 
as  you  don’t  push  on  the  book  (or  push  along  the  surface  only),  the  only  other  force  having  a  component 
perpendicular  to  the  surface  is  gravity,  so  the  magnitude  of  the  normal  force  better  be  equal  to  that  (or  the 
book  would  either  spontaneously  start  to  float,  or  fall  through  the  table).  You  can  find  this  component  by 
decomposing  the  gravitational  force  along  the  directions  perpendicular  and  parallel  to  the  slanted  surface. 
The  remaining  component  of  the  gravitational  force  points  downward  along  the  surface  of  the  table,  and  is 
comparable  to  the  force  you  were  exerting  on  the  book  in  the  flat  case.  Up  to  some  point  it  is  balanced  by  a 
static  frictional  force,  but  once  it  gets  too  large  (because  the  slant  angle  of  the  table  gets  too  large),  friction 
reaches  its  maximum  and  gravity  results  in  a  net  force  on  the  book,  which  will  start  to  slide  down  (as  you  no 
doubt  guessed  already) . 


2.5.  Statics 

When  multiple  forces  act  on  a  body,  the  (vector)  sum  of  those  forces  gives  the  net  force,  which  is  the  force  we 
substitute  in  Newton’s  second  law  of  motion  to  get  the  equation  of  motion  of  the  body.  If  all  forces  sum  up  to 
zero,  there  will  be  no  acceleration,  and  the  body  retains  whatever  velocity  it  had  before.  Statics  is  the  study  of 
objects  that  are  neither  currently  moving  nor  experiencing  a  net  force,  and  thus  remain  stationary.  You  might 
expect  that  this  study  is  easier  than  the  dynamical  case  when  bodies  do  experience  a  net  force,  but  that  just 
depends  on  context.  Imagine,  for  example,  ajar  filled  with  marbles:  they  aren’t  moving,  but  the  forces  acting 
on  the  marbles  are  certainly  not  zero,  and  also  not  uniformly  distributed. 

Even  if  there  is  no  net  force,  there  is  no  guarantee  that  an  object  will  exhibit  no  motion:  if  the  forces  are 
distributed  unevenly  along  an  extended  object,  it  may  start  to  rotate.  Rotations  always  happen  around  a 
stationary  point,  known  as  the  pivot.  Only  a  force  that  has  a  component  perpendicular  to  the  line  connecting 
its  point  of  action  to  the  pivot  (the  arm)  can  make  an  object  rotate.  The  corresponding  angular  acceleration 
due  to  the  force  depends  on  both  the  magnitude  of  that  perpendicular  component  and  the  length  of  the  arm, 
and  is  known  as  the  moment  of  the  force  or  the  torque  tau.  The  magnitude  of  the  torque  is  therefore  given  by 
Fr  sin0,  where  F  is  the  magnitude  of  the  force,  r  the  length  of  the  arm,  and  6  the  angle  between  the  force  and 
the  arm.  If  we  write  the  arm  as  a  vector  r  pointing  from  the  pivot  to  the  point  where  the  force  acts,  we  find 
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that  the  magnitude  of  the  torque  equals  the  cross  product  of  r  and  F: 

r  =  r  x  F.  (2.25) 

The  direction  of  rotation  can  be  found  by  the  right-hand  rule  from  the  direction  of  the  torque:  if  the  thumb 
of  your  right  hand  points  along  the  direction  of  r,  then  the  direction  in  which  your  fingers  curve  will  be  the 
direction  in  which  the  object  rotates  due  to  the  action  of  the  corresponding  force  F. 

We  will  study  rotations  in  detail  in  chapter  .  For  now,  we’re  interested  in  the  case  that  there  is  no  motion, 
neither  linear  nor  rotational,  which  means  that  the  forces  and  torques  acting  on  our  object  must  satisfy  the 
stability  condition,  for  an  extended  object  to  be  stationary,  both  the  sum  of  the  forces  and  the  sum  of  the 
torques  acting  on  it  must  be  zero. 

2.5.1.  Worked  example:  Suspended  sign 


Figure  2.7:  A  suspended  sign  (example  of  a  calculation  in  statics),  (a)  Problem  setting,  (b)  Free-body  diagram. 

A  sign  of  mass  M  hangs  suspended  from  a  rod  of  mass  m  and  length  L  in  a  symmetric  way  and  such 
that  the  centers  of  mass  of  the  sign  and  rod  nicely  align  (figure  2.  a).  One  end  of  the  rod  is  anchored  to  a 
wall  directly,  while  the  other  is  supported  by  a  wire  with  negligible  mass  that  is  attached  to  the  same  wall  a 
distance  li  above  the  anchor,  (a)  If  the  maximum  tension  the  wire  can  support  is  T,  find  the  minimum  value 
of  h.  (b)  For  the  case  that  the  tension  in  the  wire  equals  the  maximum  tension,  find  the  force  (magnitude  and 
direction)  exerted  by  the  anchor  on  the  rod. 

Solution 

(a)  We  first  draw  a  free-body  diagram,  figure  2.7b.  Force  balance  on  the  sign  tells  us  that  the  tensions  in 
the  two  lower  wires  sum  to  the  gravitational  force  on  the  sign.  The  rod  is  stationary,  so  we  know  that 
the  sum  of  the  torques  on  it  must  vanish.  To  get  torques,  we  first  need  a  pivot;  we  pick  the  point  where 
the  rod  is  anchored  to  the  wall.  We  then  have  three  forces  contributing  a  clockwise  torque,  and  one 
contributing  a  counterclockwise  torque.  We’re  not  told  exactly  where  the  wires  are  attached  to  the  rod, 
but  we  are  told  that  the  configuration  is  symmetric  and  that  the  center  of  mass  of  the  sign  aligns  with 
that  of  the  rod.  Let  the  first  wire  be  a  distance  aL  from  the  wall,  and  the  second  a  distance  (1  -  a)L. 
The  total  (clockwise)  torque  due  to  the  gravitational  force  on  the  sign  and  rod  is  then  given  by:  tz  = 
| mgL+  \MgaL  +  \Mg(\  -  a)L-  \{m  +  M)gL.  The  counterclockwise  torque  comes  from  the  tension  in 
the  wire,  and  is  given  by  Twire  =  FjsindL  =  Fj{h/Vh2  +  L2)L.  Equating  the  two  torques  allows  us  to  solve 
for  h  as  a  function  of  Ft,  as  requested,  which  gives: 

lt2jnm*m)\hz+L2)  _  _ 

^ 2  F't  I  j 4^2  _  +  M)2g2 


We  find  the  minimum  value  of  h  by  substituting  F\  -  Tmax. 
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(b)  As  the  rod  is  stationary,  all  forces  on  it  must  cancel.  In  the  horizontal  direction,  we  have  the  horizontal 
component  of  the  tension,  7  max  cosO  to  the  left,  which  must  equal  the  horizontal  component  of  the  force 
exerted  by  the  wall,  Fw  cos  <p.  In  the  vertical  direction,  we  have  the  gravitational  force  and  the  two  forces 
from  the  wires  on  which  the  sign  hangs  in  the  downward  direction,  and  the  vertical  component  of  the 
tension  in  the  wire  in  the  upward  direction,  the  sum  of  which  must  equal  the  vertical  component  of  the 
force  exerted  by  the  wall  (which  may  point  either  up  or  down).  We  thus  have 

Fw  COS  <p —  Tmax  COs6, 

Fwsitup-  ( m  +  M)g+  rmaxsin0, 

where  tan#  -  hi L  and  h  is  given  in  the  answer  to  (a).  We  find  that 

Fw=  T?nax  +  2{m  +  M)gTmaxsmG  +  (m  +  M)2g2, 

Tmaxcos6 

taiup  = - . 

C m  +  M)g+  Tmax  smd 

Note  that  the  above  expressions  give  the  complete  answer  (magnitude  and  direction).  We  could  eliminate 
h  and  0,  but  that’d  just  be  algebra,  leading  to  more  complicated  expressions,  and  not  very  useful  in  itself. 
If  we’d  been  asked  to  calculate  the  height  or  force  for  any  specific  values  of  M,  m,  and  L,  we  could  get  the 
answers  easily  by  substituting  the  numbers  in  the  expressions  given  here. 


2.6.  Solving  the  equations  of  motion  in  three  special  cases* 

In  section  2.3  we  saw  some  examples  of  equations  of  motion  originating  from  Newton’s  second  law  of  motion. 
For  the  quite  common  case  that  the  mass  of  our  object  of  interest  is  constant,  its  trajectory  will  be  given  as 
the  solution  of  a  second-order  ordinary  differential  equation,  with  time  as  our  variable.  In  general,  the  force 
in  Newton’s  second  law  may  depend  on  time  and  position,  as  well  as  on  the  first  derivative  of  the  position, 
i.e.,  the  velocity.  In  one  dimension,  we  thus  have 

mx-F{x,x,t).  (2.26) 

Equation  (2.26)  can  be  hard  to  solve  for  complicated  functions  F.  However,  in  each  of  the  special  cases  that 
the  force  only  depends  on  one  of  the  three  variables,  we  can  write  down  a  general  solution  -  albeit  as  an 
integral  over  the  force,  which  we  may  or  may  not  be  able  to  calculate  explicitly. 


2.6.1.  Case  1:  F  =  F{t) 

If  the  force  only  depends  on  time,  we  can  solve  equation  (  )  by  direct  integration.  Using  that  v  -  x,  we 

have  mv  -  Fit),  which  we  integrate  to  find 


f 

J  to 


Fit')  dt'  -  m 


f  dv' 

Jv  0 


=  m[v{t)~  i/q], 


(2.27) 


where  at  the  initial  time  t  —  to  the  object  has  velocity  v  =  vq.  We  can  now  find  the  position  by  integrating  the 
velocity: 


x{t) 


=  f  v{t')dt'. 

J  to 


(2.28) 


2.6.2.  Case  2:  F  =  F{x) 

If  the  force  depends  only  on  the  position  in  space  (as  is  the  case  for  the  harmonic  oscillator),  we  cannot 
integrate  over  time,  as  to  do  so  we  would  already  need  to  know  x(  t).  Instead,  we  invoke  the  chain  rule  to 
rewrite  our  differential  equation  as  an  equation  in  which  the  position  is  our  variable.  We  have: 


df  dv  dx  dv 
dt  dx  dt  dx’ 


(2.29) 


di> 

mv —  =  Fix), 
dx 


and  so  our  equation  of  motion  becomes 


(2.30) 


2.6.  Solving  the  equations  of  motion  in  three  special  cases 


19 


which  we  again  solve  by  direct  integration: 


(2.31) 


To  get  x{t),  we  use  the  relation  that  d xl  df  =  v{x).  Separation  of  variables  gives  Axlv(x)  -  Iddt,  which  we 
can  integrate  to  get 

rx  1 

t-t0=l  ——Ax',  (2.32) 

Jx o  v[x') 

which  gives  us  t{x).  In  principle  we  can  invert  this  expression  to  give  us  x(t),  although  in  practice  this  may 
not  be  easy. 

2.6.3.  Case  3:  F  =  F(v) 

If  the  force  depends  only  on  the  velocity,  there  are  two  ways  we  can  proceed.  We  can  write  the  equation  of 
motion  as  mAvl  At  —  F{v)  and  use  separation  of  variables  to  get: 


(2.33) 


from  which  we  can  get  v{t )  after  inverting,  and  x[t )  after  integrating  u(  t)  as  in  equation  (2.21 ).  Alternatively, 
we  could  again  rewrite  our  equation  of  motion  as  an  equation  in  space  instead  of  time,  and  arrive  at: 

r  v'  ,  , 

x-x0  =  m\  ——Av.  (2.34) 

Jvo  F(V') 

From  equation  (2.3 4)  we  can  get  v(x)  by  inverting,  and  x( t)  from  equation  (2.32).  Note  that  equation  (2.34) 
does  not  give  us  x(t)  directly,  as  x  is  the  variable  in  that  equation. 

2.6.4.  Worked  example:  velocity  of  the  harmonic  oscillator 

It  may  seem  that  what  we’ve  done  so  far  in  this  section  has  hardly  helped  matters:  the  ‘solutions’  we  found 
contain  integrals  and  often  need  to  be  inverted  to  get  our  desired  function  x(t)  (or,  depending  on  the  problem 
we’re  studying,  v{t)  or  v{x)).  To  show  you  how  these  solutions  may  be  useful,  let’s  consider  a  specific  example: 
a  harmonic  oscillator,  consisting  of  a  mass  on  a  Hookean  spring,  with  F  -  h'(x)  =  -kx.  We  already  wrote  down 
the  equation  of  motion  (  )  and  its  general  solution  (  18).  The  general  solution  can  be  found  through  the 

substitution  of  exponentials,  as  we’ll  do  in  section  8.1.  However,  we  can  also  learn  something  useful  from 
writing  the  equation  of  motion  in  the  form  (2.3  ).  Its  solution,  formally  given  by  equation  (  ),  can  be 

calculated  explicitly  for  our  force  as 


^[i>2(x)- Vq]  =  f  {-kx')Ax'  =  -^[x2-Xq] 
2  Jx o  2 


(2.35) 


which  gives 


(2.36) 


for  v{x).  Although  x{t )  and  v(t)  are  more  easily  obtained  from  the  solution  given  in  equation  (2.18),  that 
solution  will  not  give  you  v{x),  and  deriving  it  is  tricky.  Here  we  get  it  almost  for  free.  Moreover,  as  you 
have  probably  noted,  equation  (2.3  )  relates  the  kinetic  to  the  potential  energy  of  the  harmonic  oscillator  -  a 
special  case  of  conservation  of  energy,  which  we’ll  discuss  in  the  next  section. 
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2.7.  Problems 

2.1  The  terminal  velocity  is  the  maximum  (constant)  velocity  a  dropping  object  reaches.  In  this  problem,  we 

use  equation  (  )  for  the  drag  force. 

(a)  Use  dimensional  analysis  to  relate  the  terminal  velocity  of  a  falling  object  to  the  various  relevant 
parameters. 

(b)  Estimate  the  terminal  velocity  of  a  paraglider  (figure  2.5c). 

(c)  Use  the  concept  of  terminal  velocity  to  predict  whether  a  mouse  (without  parachute)  is  likely  to  sur¬ 
vive  a  fall  from  a  high  tower. 

2.2  When  you  cook  rice,  some  of  the  dry  grains  always  stick  to  the  measuring  cup.  A  common  way  to  get 
them  out  is  to  turn  the  measuring  cup  upside-down  and  hit  the  bottom  (now  on  top)  with  your  hand  so 
that  the  grains  come  off  [  12] . 

(a)  Explain  why  static  friction  is  irrelevant  here. 

(b)  Explain  why  gravity  is  negligible. 

(c)  Explain  why  hitting  the  cup  works,  and  why  its  success  depends  on  hitting  the  cup  hard  enough. 

2.3  A  ball  is  thrown  at  speed  v  from  zero  height  on  level  ground.  We  want  to  find  the  angle  6  at  which  it 
should  be  thrown  so  that  the  area  under  the  trajectory  is  maximized. 

(a)  Sketch  of  the  trajectory  of  the  ball. 

(b)  Use  dimensional  analysis  to  relate  the  area  to  the  initial  speed  v  and  the  gravitational  acceleration  g. 

(c)  Write  down  the  x  and  y  coordinates  of  the  ball  as  a  function  of  time. 

(d)  Find  the  total  time  the  ball  is  in  the  air. 

(e)  The  area  under  the  trajectory  is  given  by  A  -  f  ydx.  Make  a  variable  transformation  to  express  this 
integral  as  an  integration  over  time. 

(f)  Evaluate  the  integral.  Your  answer  should  be  a  function  of  the  initial  speed  v  and  angle  0. 

(g)  From  your  answer  at  (f ) ,  find  the  angle  that  maximizes  the  area,  and  the  value  of  that  maximum  area. 
Check  that  your  answer  is  consistent  with  your  answer  at  (b) . 

2.4  If  a  mass  m  is  attached  to  a  given  spring,  its  period  of  oscillation  is  T.  If  two  such  springs  are  connected 
end  to  end,  and  the  same  mass  m  is  attached,  find  the  new  period  T' ,  in  terms  of  the  old  period  T. 

2.5  Two  blocks,  of  mass  m  and  2m,  are  connected  by  a  massless  string  and  slide  down  an  inclined  plane  at 
angle  0.  The  coefficient  of  kinetic  friction  between  the  lighter  block  and  the  plane  is  /i,  and  that  between 
the  heavier  block  and  the  plane  is  2/i.  The  lighter  block  leads. 

(a)  Find  the  magnitude  of  the  acceleration  of  the  blocks. 

(b)  Find  the  tension  in  the  taut  string. 

2.6  A  1000  kg  boat  is  traveling  at  100  km/h  when  its  engine  is  shut  off.  The  magnitude  F( j  of  the  drag  force 
between  the  boat  and  the  water  is  proportional  to  the  speed  v  of  the  boat,  with  a  drag  coefficient  (  - 
70  N  •  s/m.  Find  the  time  it  takes  the  boat  to  slow  to  45  km/h. 

2.7  Two  particles  on  a  line  are  mutually  attracted  by  a  force  F  =  -ar,  where  a  is  a  constant  and  r  the  distance 
of  separation.  At  time  t  —  0,  particle  A  of  mass  m  is  located  at  the  origin,  and  particle  B  of  mass  ml 4  is 
located  at  r  =  5.0  cm. 

(a)  If  the  particles  are  at  rest  at  t  =  0,  at  what  value  of  r  do  they  collide? 

(b)  What  is  the  relative  velocity  of  the  two  particles  at  the  moment  the  collision  occurs? 
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2.8  In  drag  racing,  specially  designed  cars  maximize  the  friction  with  the  road  to  achieve  maximum  acceler¬ 
ation.  Consider  a  drag  racer  (or  ‘dragster’)  as  shown  in  figure  2.8,  for  which  the  center  of  mass  is  close  to 
the  rear  wheels. 


Figure  2.8:  A  drag  racer  or  dragster  [  ] ,  CC  BY-SA  3.0. 


(a)  Draw  a  free-body  diagram  of  the  dragster  in  side  view.  Draw  the  wheels  as  circles,  and  approximate 
the  shape  of  the  dragster  body  as  a  triangle  with  a  horizontal  line  between  the  wheels,  a  vertical  line 
going  up  from  the  rear  axis,  and  a  diagonal  line  connecting  the  top  to  the  front  wheels.  NB:  consider 
carefully  the  direction  of  the  friction  force! 

(b)  On  which  of  the  wheels  is  the  frictional  force  the  largest? 

(c)  The  frictional  force  is  maximized  if  the  wheels  just  don’t  slip  (because,  as  usual,  the  coefficient  of 
kinetic  friction  is  smaller  than  that  of  static  friction).  Find  the  maximal  possible  frictional  force  on 
the  rear  wheels. 

(d)  Find  the  maximal  possible  acceleration  of  the  dragster. 

(e)  For  a  coefficient  of  (static)  friction  of  1.0  (a  fairly  realistic  value  for  rubber  and  concrete)  and  a  track  of 
500  m,  find  the  maximal  velocity  a  drag  racer  can  achieve  at  the  end  of  the  track  when  starting  from 
rest. 

2.9  Blocks  A,  B  and  C  are  placed  as  shown  in  the  figure,  and  connected  by  ropes  of  negligible  mass.  Both 
A  and  B  weigh  20.0  N  each,  and  the  coefficient  of  kinetic  friction  between  each  block  and  the  surface  is 
0.3.  The  slope’s  angle  6  equals  42.0°.  The  disks  in  the  pulleys  are  of  negligible  mass.  After  the  blocks  are 
released,  block  C  descends  with  constant  velocity. 


(a)  Find  the  tension  in  the  rope  connecting  blocks  A  and  B. 

(b)  What  is  the  weight  of  block  C? 

(c)  If  the  rope  connecting  blocks  A  and  B  were  cut,  what  would  be  the  acceleration  of  C? 
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2.10  The  figure  below  shows  a  common  present-day  seesaw  design.  In  addition  to  a  beam  with  two  seats,  this 
seesaw  also  contains  two  identical  springs  that  connect  the  beam  to  the  ground.  The  distance  between 
the  pivot  and  each  of  the  springs  is  30.0  cm,  the  distance  between  the  pivot  and  each  of  the  seats  is  1 .50  m. 


3.00  m 


(a)  A  4-year-old  weighing  20.0  kg  sits  on  one  of  the  seats,  causing  it  to  drop  by  20.0  cm.  Draw  a  free-body 
diagram  of  the  seesaw  with  the  child,  in  which  you  include  all  relevant  forces  (to  scale). 

(b)  Use  your  diagram  and  the  provided  data  to  calculate  the  spring  constant  of  the  two  springs  present 
in  the  seesaw. 

2.11  Two  marbles  of  identical  mass  m  and  radius  r  are  dropped  in  a  cylindrical  container  with  radius  3r,  as 
shown  in  the  figure.  Find  the  force  exerted  by  the  marbles  on  points  A,  B  and  C,  and  the  force  the  marbles 
exert  on  each  other. 
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2.12  Round  fruits  like  oranges  and  mandarins  are  typically  stacked  in  alternating  rows,  as  shown  in  figure 

Suppose  you  have  a  crate  with  a  square  base  that  is  exactly  five  oranges  wide.  You  stack  25  oranges  in  the 
crate,  then  put  another  16  on  top  in  the  holes,  and  then  add  a  second  layer  of  25  oranges,  held  in  place 
by  the  sides  of  the  crate.  Find  the  total  force  on  the  sides  of  the  crate  in  this  configuration.  Assume  all 
oranges  are  spheres  with  a  diameter  of  8.0  cm  and  a  mass  of  250  g. 
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Figure  2.9:  Stacked  fruit,  (a)  Stacked  mandarins  at  a  fruit  stand  [9].  (b)  Cross-section  of  stacked  oranges  in  a  crate. 
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2.13  Objects  with  densities  less  than  that  of  water  float,  and  even  objects  that  have  higher  densities  are  ‘lighter’ 
in  the  water.  The  force  that’s  responsible  for  this  is  known  as  the  buoyancy  force,  which  is  equal  but 
opposite  to  the  gravitational  force  on  the  displaced  water:  Ttmoyancy  =  PwgVw,  where  pw  is  the  water’s 
density  and  Vw  the  displaced  volume.  In  parts  (a)  and  (b),  we  consider  a  block  of  wood  with  density 
p  <  pw  which  is  floating  in  water. 

(a)  Which  fraction  of  the  block  of  wood  is  submerged  when  floating? 

(b)  You  push  down  the  block  somewhat  more  by  hand,  then  let  go.  The  block  then  oscillates  on  the 
surface  of  the  water.  Explain  why,  and  calculate  the  frequency  of  the  oscillation. 

(c)  You  take  out  the  piece  of  wood,  and  now  float  a  piece  of  ice  in  a  bucket  of  water.  On  top  of  the  ice,  you 
place  a  small  stone.  When  everything  has  stopped  moving,  you  mark  the  water  level.  Then  you  wait 
till  the  ice  has  melted,  and  the  stone  has  dropped  to  the  bottom  of  the  bucket.  What  has  happened 
to  the  water  level?  Explain  your  answer  (you  can  do  so  either  qualitatively  through  an  argument  or 
quantitatively  through  a  calculation). 

(d)  Rubber  ducks  also  float,  but,  despite  the  fact  that  they  have  a  flat  bottom,  they  usually  do  not  stay 
upright  in  water.  Explain  why. 

(e)  You  drop  a  5.0  kg  ball  with  a  radius  of  10  cm  and  a  drag  coefficient  Cd  of  0.20  in  water  (viscosity 
1.002  mPas).  This  ball  has  a  density  higher  than  that  of  water,  so  it  sinks.  After  a  while,  it  reaches  a 
constant  velocity,  known  as  its  terminal  velocity.  What  is  the  value  of  this  terminal  velocity? 

(f)  When  the  ball  in  (e)  has  reached  terminal  velocity,  what  is  the  value  of  its  Reynolds  number  (see 
problem  1.3)? 

2.14  A  uniform  stick  of  mass  M  and  length  L  =  1.00  m  has  a  weight  of  mass  m  hanging  from  one  end.  The 
stick  and  the  weight  hang  in  balance  on  a  force  scale  at  a  point  x  —  20.0  cm  from  the  end  of  the  stick.  The 
measured  force  equals  3.00  N.  Find  both  the  mass  M  of  the  stick  and  m  of  the  weight. 


a 


2.15  A  uniform  rod  with  a  length  of  4.25  m  and  a  mass  of  47.0  kg  is  attached  to  a  wall  with  a  hinge  at  one  end. 
The  rod  is  held  in  horizontal  position  by  a  wire  attached  to  its  other  end.  The  wire  makes  an  angle  of  30.0° 
with  the  horizontal,  and  is  bolted  to  the  wall  directly  above  the  hinge.  If  the  wire  can  support  a  maximum 
tension  of  1250  N  before  breaking,  how  far  from  the  wall  can  a  75.0  kg  person  sit  without  breaking  the 
wire? 

2.16  A  wooden  bar  of  uniform  density  but  varying  thickness  hangs  suspended  on  two  strings  of  negligible 
mass.  The  strings  make  angles  6\  and  82  with  the  horizontal,  as  shown.  The  bar  has  total  mass  m  and 
length  L.  Find  the  distance  x  between  the  center  of  mass  of  the  bar  and  its  (thickest)  end. 
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2.17  A  bicycle  wheel  of  radius  R  and  mass  M  is  at  rest  against  a  step  of  height  3R 14,  as  shown  in  the  figure. Find 
the  minimum  horizontal  force  F  that  must  be  applied  to  the  axle  to  make  the  wheel  start  to  rise  over  the 
step. 


F 


3R/4 


2.18  A  block  of  mass  M  is  pressed  against  a  vertical  wall,  with  a  force  F  applied  at  an  angle  9  with  respect  to 
the  horizontal  [-n/2  <9  <  n/2),  as  shown  in  the  figure.  The  friction  coefficient  of  the  block  and  the  wall 
is  p.  We  start  with  the  case  9-0,  i.e.,  the  force  is  perpendicular  to  the  wall. 


(a)  Draw  a  free-body  diagram  showing  all  forces. 

(b)  If  the  block  is  to  remain  stationary,  the  net  force  on  it  should  be  zero.  Write  down  the  equations 
for  force  balance  (i.e.,  the  sum  of  all  forces  is  zero,  or  forces  in  one  direction  equal  the  forces  in  the 
opposite  direction)  for  the  x  and  y  directions. 

(c)  From  the  two  equations  you  found  in  (b),  solve  for  the  force  F  needed  to  keep  the  book  in  place. 

(d)  Now  repeat  the  steps  you  took  in  (a)  -  (c)  for  a  force  under  a  given  angle  9,  and  find  the  required  force  F. 

(e)  For  what  angle  9  is  this  minimum  force  F  the  smallest?  What  is  the  corresponding  minimum  value  of 
F? 

(f)  What  is  the  limiting  value  of  9,  below  which  it  is  not  possible  to  keep  the  block  up  (independent  of 
the  magnitude  of  the  force)? 

2.19  A  spherical  stone  of  mass  m  =  0.250  kg  and  radius!?  =  5.0  cm  is  launched  vertically  from  ground  level  with 

an  initial  speed  of  Vq  —  15.0  m/s.  As  it  moves  upwards,  it  experiences  drag  from  the  air  as  approximated 

by  Stokes  drag,  F  —  6nrjR  v,  where  the  viscosity  77  of  air  is  1.002  mPa  ■  s. 

(a)  Which  forces  are  acting  on  the  stone  while  it  moves  upward? 

(b)  Using  Newton’s  second  law  of  motion,  write  down  an  equation  of  motion  for  the  stone  (this  is  a  dif¬ 
ferential  equation).  Be  careful  with  the  signs.  Flint:  Newton’s  second  law  of  motion  relates  force  and 
acceleration,  and  the  drag  force  is  in  terms  of  the  velocity.  What  is  the  relation  between  the  two? 
Simplify  the  equation  by  introducing  the  characteristic  time  t  =  (/”w . 

(c)  Find  a  particular  solution  vp  (t)  of  your  inhomogeneous  differential  equation  from  (  ). 

(d)  Find  the  solution  v^it)  of  the  homogeneous  version  of  your  differential  equation. 

(e)  Use  the  results  from  (  9c)  and  (19d)  and  the  initial  condition  to  find  the  general  solution  u(t)  of  your 
differential  equation. 

(f)  From(L9  ),  find  the  time  at  which  the  stone  reaches  its  maximum  height. 

(g)  From  v{t),  find  h{t)  for  the  stone  (height  as  a  function  of  time). 

(h)  Using  your  answers  to  ( I9f)  and  ( 19g),  find  the  maximum  height  the  stone  reaches. 
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3.1.  Work 

How  much  work  do  you  need  to  do  to  move  a  box?  Well,  that  depends  on  two  things:  how  heavy  the  box  is, 
and  how  far  you  have  to  move  it.  Multiply  the  two,  and  you’ve  got  a  good  measure  of  how  much  work  will  be 
required.  Of  course,  work  can  be  done  in  other  contexts  as  well  -  pulling  a  spring  from  equilibrium,  or  cycling 
against  the  wind.  In  each  case,  there’s  a  force  and  a  displacement.  To  be  fair,  we  will  only  count  the  part  of  the 
force  that  is  in  the  direction  of  the  displacement  (when  cycling,  you  don’t  do  work  due  to  the  fact  that  there’s 
a  gravitational  force  pulling  you  down,  since  you  don’t  move  vertically;  you  do  work  because  there’s  a  drag 
force  due  to  your  moving  through  the  air).  We  define  work  as  the  product  of  the  component  of  the  force  in 
the  direction  of  the  displacement,  times  the  displacement  itself.  We  calculate  this  component  by  projecting 
the  force  vector  on  the  displacement  vector,  using  the  dot  product  (see  appendix  for  an  introduction  in  to 
vector  math): 

W  =  F-x.  (3.1) 

Note  that  work  is  a  scalar  quantity  -  it  has  a  magnitude  but  no  direction.  Work  is  measured  in  Joules  (J),  with 
one  Joule  being  equal  to  one  Newton  times  one  meter. 

Of  course  the  force  acting  on  our  object  need  not  be  constant  everywhere.  Take  for  example  the  extension 
of  a  spring:  the  further  you  pull,  the  larger  the  force  gets,  as  given  by  Hooke’s  law  (  ).  To  calculate  the  work 

done  when  extending  the  spring,  we  chop  up  the  path  (here  a  straight  line)  into  many  small  pieces.  For  each 
piece,  we  approximate  the  force  by  the  average  value  on  that  piece,  then  multiply  with  the  length  of  the  piece 
and  sum.  In  the  limit  that  we  have  infinitely  many  pieces,  this  approximation  becomes  exact,  and  the  sum 
becomes  an  integral:  for  one  dimension,  we  thus  have: 

r»x 2 

W  =  /  F{x)dx.  (3.2) 

Jx  1 

Likewise,  the  path  along  which  we  move  need  not  be  a  straight  line.  If  the  path  consists  of  multiple  straight 
segments,  on  each  of  which  the  force  is  constant,  we  can  calculate  the  total  work  by  adding  the  work  done 
on  the  different  segments.  Taking  the  limit  to  infinitely  many  infinitesimally  small  segments  dr,  on  each  of 
which  the  force  is  given  by  the  value  F(r),  the  sum  again  becomes  an  integral: 

rr2 

W=  F(r)  •  dr.  (3.3) 

Jr  i 

Equation  (3.3)  is  the  most  general  version  of  the  definition  of  work;  it  simplifies  to  (3.2)  for  movement  along 
a  straight  line,  and  to  (3.1)  if  both  the  path  is  straight  and  the  force  constant  . 

1  If  you  feel  intimidated  by  the  vector  form  of  equation  (3.3),  it  may  help  to  rewrite  it  in  terms  of  the  magnitudes  of  the  force  F(r)  and 
the  (infinitesimal)  displacement  dr,  and  the  angle  6  between  them.  In  terms  of  F  =  |F|,  dr  =  |dr  and  9,  we  have  F  •  dr  =  F  cos  6  dr,  an 
expression  you  may  have  seen  before  for  a  force  not  pointing  in  the  same  direction  as  the  displacement.  If  we  now  make  the  force  and 
displacements  functions  of  the  position  r,  then  so  become  the  magnitude  of  the  force  and  the  angle,  so  we  can  also  write  equation  (3.3) 
as 

rr 2 

W=\  F(r)cos0(r)  •  dr.  (3.4) 

Jr\ 
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In  general,  the  work  done  depends  on  the  path  taken  -  for  example,  it’s  more  work  to  take  a  detour  when 
biking  from  home  to  work,  assuming  the  air  drag  is  the  same  everywhere.  However,  in  many  important  cases 
the  work  done  in  getting  from  one  point  to  another  depends  on  the  endpoints  only.  Forces  for  which  this  is 
true  are  called  conservative  forces.  As  we’ll  see  below,  the  force  exerted  by  a  spring  and  that  exerted  by  gravity 
are  both  conservative. 

Sometimes  we  will  not  be  interested  in  how  much  work  is  done  in  generating  a  certain  displacement,  but 
over  a  certain  amount  of  time  -  for  instance,  a  generator  generates  work  by  getting  something  to  move,  like 
a  wheel  or  a  valve,  but  we  don’t  typically  care  about  those  details,  we  want  to  know  how  much  work  we  can 
expect  to  get  out  of  the  generator,  i.e.,  how  much  power  it  has.  Power  is  defined  as  the  amount  of  work  per 
unit  time,  or 


d  W 
df 


(3.5) 


Power  is  measured  in  Joules  per  second,  or  Watts  (W).  To  find  out  how  much  work  is  done  by  an  engine  that 
has  a  certain  power  output,  we  need  to  integrate  that  output  over  time: 


W  = 


(3.6) 


3.2.  Kinetic  energy 

Newton’s  first  law  told  us  that  a  moving  object  will  stay  moving  unless  a  force  is  acting  on  it  -  which  holds 
for  moving  with  any  speed,  including  zero.  Now  if  you  want  to  start  moving  something  that  is  initially  at 
rest,  you’ll  need  to  accelerate  it,  and  Newton’s  second  law  tells  you  that  this  requires  a  force  -  and  moving 
something  means  that  you’re  displacing  it.  Therefore,  there  is  work  involved  in  getting  something  moving. 
We  define  the  kinetic  energy  {K)  of  a  moving  object  to  be  equal  to  the  work  required  to  bring  the  object  from 
rest  to  that  speed,  or  equivalently,  from  that  speed  to  rest: 

K -  -mv2.  (3.7) 

2 

Because  the  kinetic  energy  is  equal  to  an  amount  of  work,  it  is  also  a  scalar  quantity,  has  the  same  dimension, 
and  is  measured  in  the  same  unit.  The  factor  v2  is  the  square  of  the  magnitude  of  the  velocity  of  the  moving 
object,  which  you  can  calculate  with  the  dot  product:  v2  —  v-v.  You  may  wonder  where  equation  (3.7)  comes 
from.  Newton’s  second  law  tells  us  that  F  =  mdvl  dt,  relating  the  force  to  an  infinitesimal  change  in  the 
velocity.  In  the  definition  for  work,  equation  (3.:  ),  we  multiply  the  force  with  an  infinitesimal  change  in  the 
position  dr.  That  infinitesimal  displacement  takes  an  infinitesimal  amount  of  time  df,  which  is  related  to 
the  displacement  by  the  instantaneous  velocity  v:  dr  =  vdt.  We  can  now  calculate  the  work  necessary  to 
accelerate  from  zero  to  a  finite  speed: 


K  = 


f  f  dv 

r  dv 

f  m  f 

/  F  ■  dr  -  /  m vdt  — 

/  mv - df  = 

|  mv-  dv  =  —  J 

J  J  dt  J 

'  dt  J 

'  2  J 

d{v-  v)  =  -  mv2 
2 


(3.8) 


where  we  used  that  the  dot  product  is  commutative  and  the  fact  that  the  integral  over  the  derivative  of  a 
function  is  the  function  itself. 

Of  course,  now  that  we  know  that  the  kinetic  energy  is  given  by  equation  (3.7),  we  no  longer  need  to  use  a 
complicated  integral  to  calculate  it.  However,  because  the  kinetic  energy  is  ultimately  given  by  this  integral, 
which  is  equal  to  a  net  amount  of  work,  we  arrive  at  the  following  statement,  sometimes  referred  to  as  the 
Work-energy  theorem:  the  change  in  kinetic  energy  of  a  system  equals  the  net  amount  of  work  done  on  or 
by  it  (in  case  of  increase/ decrease  of  K) : 

\K  -  IVnet- 


(3.9) 


3.2.  Kinetic  energy 
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Figure  3.1:  Examples  of  high  power  resulting  in  high  kinetic  energy,  (a)  Running  cheetah,  the  fastest  land  animal,  which  can  reach  speeds 
over  100  km/h  in  2-3  seconds,  corresponding  to  an  enormous  increase  in  its  kinetic  energy  [10],  CC  BY-SA  3.0.  (b)  Allyson  Felix  running 
second  in  the  women’s  4  x  400  relay  of  the  2012  London  Summer  Olympics  [1 1],  CC  BY-SA  3.0.  (c)  Robert  Garrett  preparing  to  throw  the 
discus  at  the  1896  Athens  Summer  Olympics  [12].  Unlike  the  runners,  the  goal  of  discus  throwing  is  to  maximize  the  distance,  not  the 
speed,  but  to  get  the  largest  possible  distance,  the  discus  must  still  get  the  maximal  possible  kinetic  energy. 
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3.3.  Potential  energy 

We  already  encountered  conservative  forces  in  section  3.  .  The  work  done  by  a  conservative  force  is  (by  defi¬ 
nition)  path-independent;  that  means  that  in  particular  the  work  done  when  moving  along  any  closed  path 
must  be  zero: 

<j>  F-  dr  =  0.  (3.10) 

For  a  conservative  force,  we  can  thus  define  a  potential  energy  difference  between  points  1  and  2  as  the  work 
necessary  to  move  an  object  from  point  1  to  point  2: 


pr2 

\U\2  —  —  I  Fdr.  (3.11) 

Jri 

Note  the  minus  sign  in  the  definition  -  this  is  a  choice  of  course,  and  you’ll  see  below  why  we  made  this  choice. 
Note  also  that  the  potential  energy  is  defined  only  between  two  points.  Often  we  will  choose  a  convenient 
reference  point  and  calculate  the  potential  energy  at  any  other  point  with  respect  to  that  point.  The  reference 
point  is  typically  either  the  origin  or  infinity,  if  the  force  happens  to  be  zero  at  either  of  these.  Let’s  suppose 
we  have  set  such  a  point,  and  know  the  potential  energy  difference  with  that  point  at  any  other  point  in  space 
-  this  defines  a  (scalar)  function  U (r).  If  we  now  want  to  know  the  force  acting  on  a  particle  at  r,  all  we  need 
to  do  is  take  the  derivative  of  U{r )  -  that  is  to  say  the  gradient  in  three  dimensions  (which  simplifies  to  the 
ordinary  derivative  in  one  dimension): 

F(r)  =  -Vf/(r).  (3.12) 

Equation  (3.12)  is  extremely  useful,  as  it  gives  us  a  means  to  calculate  the  force,  which  is  a  vector  quantity, 
from  the  potential  energy  function,  which  is  a  scalar  quantity  -  and  therefore  much  simpler  to  work  with. 
For  instance,  since  energies  are  scalars,  they  can  simply  be  added,  as  we’ll  do  in  the  next  section,  whereas 
for  forces  you  need  to  do  vector  addition.  Equation  (3.12)  also  reflects  that  we  are  free  to  choose  a  reference 
point  for  the  potential  energy,  since  the  force  does  not  change  if  we  add  a  constant  to  the  potential  energy. 

3.3.1.  Gravitational  potential  energy 

We  saw  in  section  that  for  low  altitudes,  the  gravitational  force  is  given  by  Fg  =  mg,  where  g  is  a  vector 
of  constant  magnitude  g  ~  9.81  m/s2  and  always  points  down.  Therefore,  the  gravitational  force  does  no  work 
when  you  move  horizontally,  and  if  you  first  move  up  and  then  the  same  amount  down  again,  it  doesn’t  do 
any  net  work  either,  as  the  two  contributions  exactly  cancel.  Fg  is  therefore  an  example  of  a  conservative 
force,  and  we  can  define  and  calculate  the  gravitational  potential  energy  Ug  between  a  point  at  height  0  (our 
reference  point)  and  one  at  height  h: 


Ug{h)  =  - 


m[-g )  dz  =  mgh. 


(3.13) 


Note  that  by  choosing  a  minus  sign  in  the  definition  of  the  potential  energy,  we  end  up  with  a  positive  value 
of  the  energy  here. 

What  about  larger  distances,  i.e.,  Newton’s  law  of  gravity,  equation  (2.9)?  Well,  there  the  distances  are 
measured  radially,  so  any  movement  perpendicular  to  the  radial  direction  doesn’t  matter,  and  if  you  move  out 
and  back  in  again,  the  net  work  done  is  zero,  so  by  the  same  reasoning  as  before  we  again  have  a  conservative 
force.  This  force  vanishes  at  infinity,  so  it  makes  sense  to  set  that  as  a  reference  point  -  though  notice  that  that 
will  make  our  potential  energy  always  negative  in  this  case: 


UG(r)  = 


GMm 

r 


(3.14) 


where  r  is  the  distance  between  m  and  M,  and  M  sits  at  the  origin.  Of  course  we  can  also  calculate  gravita¬ 
tional  potential  differences  between  two  distances  r\  and  r2  from  M:  A Uc, ( r i ,  L?)  =  GMml  j —  Y  . 


2The  integral  sign  with  the  circle  in  equation  (  )  represents  an  integral  over  a  closed  path. 


3.3.  Potential  energy 
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Emmy  Noether  (1882-1935)  was  a  German  mathematician,  who  made  key 
contribution  both  to  the  development  of  abstract  algebra  and  to  ideas  in 
theoretical  physics.  In  physics,  she  uncovered  a  deep  connection  between 
symmetry  and  conservation  laws  (now  known  as  Noether’s  theorem,  con¬ 
sidered  by  many  as  the  most  important  theorem  for  the  development  of 
modern  physics):  for  every  continuous  symmetry  of  a  system,  there  exists 
a  conserved  quantity.  A  continuous  symmetry  is  one  that  leaves  a  sys¬ 
tem  invariant  for  an  arbitrarily  large  given  transformation;  for  example, 
the  rotation  of  a  circle  under  any  angle.  Applications  of  Noether’s  theorem 
include  conservation  of  energy  (corresponding  to  invariance  under  time 
translation,  i.e.,  it  doesn’t  matter  where  you  set  t  -  0,  section  3.  ),  conser¬ 
vation  of  momentum  (invariance  under  space  translation,  i.e.,  it  doesn’t 
matter  where  you  put  the  origin,  section  4.2)  and  conservation  of  angu- 


I 


lar  momentum  (invariance  under  space  rotation,  i.e.,  it  doesn’t  matter  in 
which  direction  you  choose  your  x-axis,  section  ).  Similar  conserva¬ 
tion  laws  are  found  in  special  and  general  relativity,  quantum  mechan-  Figure  3.2:  Emmy  Noether  [13], 
ics,  and  quantum  field  theory.  Unfortunately,  even  in  the  early  20th  cen¬ 
tury,  women  were  still  excluded  from  most  academic  positions.  Noether 
therefore  initially  worked  for  free  at  the  university  of  Erlangen,  getting  a 
paid  position  in  Gottingen  in  1915  at  the  invitation  of  Hilbert  and  Klein, 
who  had  both  been  convinced  by  the  quality  of  her  work.  Her  fame  grew 
through  the  1910s  and  1920s,  gaining  worldwide  recognition.  Due  to  her 
Jewish  descent,  she  was  dismissed  from  her  academic  position  by  the  Nazi 
government  in  1933,  and  moved  to  the  United  States,  where  she  died  two 
years  later  at  age  53.  Various  institutes  and  scholarship  programs,  mostly 
in  Germany,  are  now  named  in  her  honor. 


3.3.2.  Spring  potential  energy 

Like  the  gravitational  force,  the  Hookean  spring  force  (  )  also  depends  on  displacement  alone,  and  by  the 

same  reasoning  is  conservative  (notice  the  pattern?).  Calculating  its  associated  potential  energy  is  straight¬ 
forward,  and  taking  the  equilibrium  position  of  the  spring  as  the  reference  point,  we  find: 


(3.15) 


The  minus  sign  in  Hooke’s  Law  gives  us  a  positive  spring  potential  energy.  Note  that  x  stands  for  displacement 
here;  as  we  only  consider  one-dimensional  springs  the  lD-version  is  sufficient. 

3.3.3.  General  conservative  forces 

In  the  case  of  the  gravitational  and  spring  force  it  was  easy  to  reason  that  they  had  to  be  conservative.  It  is  also 
easy  to  see  that  the  friction  force  is  not  conservative:  if  you  take  a  longer  path,  you  need  to  do  more  net  work 
against  friction,  which  you  can  moreover  never  recover  as  mechanical  energy.  For  more  complicated  systems, 
especially  in  three  dimensions,  it  may  not  be  so  easy  to  see  whether  a  force  is  conservative.  Fortunately  there 
is  an  easy  test  you  can  perform:  if  the  curl  of  a  force  is  zero  everywhere,  it  will  be  a  conservative  force,  or 
expressed  mathematically: 


(3.16) 


Is  is  straightforward  to  show  that  if  a  force  is  conservative,  its  curl  must  vanish:  a  conservative  force  can  be 
written  as  the  gradient  of  some  scalar  function  U{x),  and  V  x  V  U(x)  -  0  for  any  function  U{x),  as  you  can 
easily  check  for  yourself.  The  proof  the  other  way  around  is  more  complicated,  and  can  be  found  in  advanced 
mechanics  textbooks. 
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3.4.  Conservation  of  energy 

Work,  kinetic  energy  and  potential  energy  are  all  quantities  with  the  same  dimension  -  so  we  can  do  arith¬ 
metic  with  them.  One  particularly  useful  quantity  is  the  total  energy  E  of  a  system,  which  is  simply  the  sum 
of  the  kinetic  and  potential  energy: 

E-K+U.  (3.17) 


Theorem  3.1  (Law  of  conservation  of  energy).  If  all  forces  in  a  system  are  conservative,  the  total  energy  in  that 
system  is  conserved. 


Proof.  For  simplicity,  we’ll  look  at  the  ID  case  (3D  goes  analogously).  Conserved  means  not  changing  in 
time,  so  in  order  to  prove  the  statement,  we  only  need  to  calculate  the  time  derivative  of  E  and  check  that  it 
is  always  zero. 


d  E 
dt 


d  K  d  U 
df  dt 

d [\mv2]  d U  dx 

— - + - 

df  dx  df 

df 

mv - Fv 

dt 


v 


=  0  by  Newton’s  second  law. 


(3.18) 


□ 


Conservation  of  energy  means  that  the  total  energy  of  a  system  cannot  change,  but  of  course  the  potential 
and  kinetic  energy  can  -  and  by  conservation  of  total  energy  we  know  that  they  get  converted  directly  into  one 
another.  Exploiting  this  fact  will  allow  us  to  analyze  and  easily  solve  many  problems  in  classical  mechanics  - 
this  conservation  law  is  an  immensely  useful  tool. 

Note  that  conservation  of  energy  is  not  the  same  as  the  work-energy  theorem  of  section  .  For  the  total 
energy  to  be  conserved,  all  forces  need  to  be  conservative.  In  the  work-energy  theorem,  this  is  not  the  case. 
You  can  therefore  calculate  changes  in  kinetic  energy  due  to  the  work  done  by  non-conservative  forces  using 
the  latter. 


U(x) 


Figure  3.3:  An  example  of  a  potential  energy  landscape.  In  this  figure,  the  total  energy  would  be  represented  by  a  horizontal  line;  the 
kinetic  energy  by  the  distance  between  the  potential  and  total  energy.  Equlibrium  points  (dots)  occur  at  extrema  of  the  potential  energy, 
when  its  derivative  (the  force)  is  zero.  The  green  dots  indicate  unstable  equilibrium  points  (maxima,  where  the  second  derivative  is 
negative),  the  orange  points  metastable  equilibria  (local  minima)  and  the  red  point  the  single  globally  stable  equilibrium  of  this  system. 


3.5.  Energy  landscapes 
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3.5.  Energy  landscapes 

In  the  previous  section  we  proved  that  the  total  energy  is  conserved.  In  the  section  before  that,  we  looked  at 
potential  energies.  Typically,  the  potential  energy  is  a  function  of  your  position  in  space.  When  we  plot  it  as 
a  function  of  spatial  coordinates,  we  get  an  energy  landscape,  measuring  an  amount  of  energy  on  the  vertical 
axis.  Of  course  we  can  also  plot  the  total  energy  of  the  system  -  and  since  that  is  conserved,  it  is  the  same 
everywhere,  and  thus  becomes  a  horizontal  line  or  plane.  Because  kinetic  energy  cannot  be  negative,  any 
point  where  the  potential  energy  is  higher  than  the  total  energy  is  not  allowed:  the  system  cannot  reach  this 
point.  When  the  potential  energy  equals  the  total  energy,  the  kinetic  energy  (and  thus  the  speed)  has  to  be 
zero.  Whenever  the  potential  energy  is  lower  than  the  total  energy,  there  is  a  positive  kinetic  energy  and  thus 
a  positive  speed. 

Probably  the  simplest  energy  landscape  is  that  of  the  harmonic  oscillator  (mass  on  a  spring)  -  it’s  a  simple 
parabola.  The  point  at  which  the  horizontal  line  representing  the  total  energy  crosses  the  parabola  corre¬ 
sponds  to  the  extrema  of  the  oscillation:  these  are  its  turning  points.  The  bottom  of  the  parabola  is  its  mid¬ 
point,  and  you  can  immediately  see  that  that’s  where  the  kinetic  energy  (and  thus  the  speed)  will  be  highest. 

Of  course  you  can  have  more  complex  energy  landscapes  than  that.  In  particular,  you  can  have  a  land¬ 
scape  with  multiple  extrema,  see  for  example  figure  .  A  particle  that  is  being  acted  upon  by  forces  described 
by  this  potential  energy,  follows  a  trajectory  in  this  landscape,  which  can  be  visualized  as  a  ball  rolling  over 
the  hills  and  valleys  of  the  landscape.  Think  back  to  the  harmonic  oscillator  example.  If  we  let  go  of  a  ball  in  a 
parabolic  vase  at  some  point  on  the  slope,  the  ball  will  roll  down  and  pick  up  speed,  then  roll  up  the  opposite 
slope  and  loose  speed,  until  it  reaches  the  same  height  where  its  speed  will  again  be  zero.  The  same  is  true 
in  more  complicated  landscapes.  Particularly  interesting  are  local  maxima.  If  you  put  a  ball  exactly  on  top  of 
one  of  them,  it  will  stay  there  -  it  is  a  fixed  point,  but  an  unstable  one,  as  any  arbitrarily  small  perturbation  will 
push  it  down.  If  you  let  go  of  a  ball  at  a  level  above  a  local  maximum,  it  may  hop  over  it  to  the  next  minimum, 
but  if  your  initial  position  (your  initial  energy)  was  too  low,  your  ball  can  get  stuck  oscillating  about  a  local 
minimum  -  a  metastable  point. 

Energy  landscapes  are  even  useful  when  the  total  energy  is  not  conserved  -  for  example  because  of  friction 
terms.  Friction  causes  energy  to  dissipate  from  the  system,  which  is  equivalent  to  having  your  ball  move  in 
the  landscape  with  friction.  For  low  friction,  your  ball  will  oscillate,  but  get  less  high  every  time,  until  it  comes 
to  rest  at  the  minimum.  For  high  friction,  it  won’t  even  oscillate,  but  just  get  to  the  minimum  -  exactly  what 
an  overdamped  system  in  real  life  does. 


3.5.1.  Worked  example:  The  Lennard-Jones  potential 

The  Lennard-Jones  potential  energy  is  a  commonly  used  model  to  describe  the  interactions  between  un¬ 
charged  atoms  and  molecules.  This  potential  energy  can  be  written  in  two  equivalent  ways: 


Uh]{r)  = 


(3.19) 


where  r  is  the  distance  between  the  atoms  or  molecules,  and  A,  B,  e  and  a  are  positive  constants. 

(a)  Find  the  dimensions  of  A,  B,  e  and  <r. 

(b)  Express  e  and  a  in  A  and  B. 

(c)  Sketch  the  potential  (in  its  second  form)  as  a  function  of  r/cr,  and  use  this  sketch  to  give  a  physical  inter¬ 
pretation  of  e  and  a. 

(d)  Does  the  Lennard-Jones  potential  lead  to  attractive  or  repulsive  forces  at  short  distances?  And  what  about 
long  distances? 

(e)  Find  all  equilibrium  points  of  this  potential  energy,  and  determine  their  stability. 


Solution 

(a)  [U]  =  Energy^  [[/]  =  M  x  Tj  x  L  = 

[A]  =  Energy  x  Length12  =>  [A]  = 

[£]  =  Energy  x  Length6  =>  [A]  = 

Because  the  powers  of  the  terms  ((y)12)  and  ((y)6)  are  different,  while  we  add  them  together,  they  have 
to  be  dimensionless,  so  [cr]  =  L  and  [e]  =  [U]  —  . 
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(b) 


4ea12  =  A  and  4ea6  -  B 


By  substituting  o  in  the  expressions  for  either  A  or  B  we  can  derive  an  expression  for  e: 

4a6  e  =  B 

2  B2 

4  eA  =  Bz^>e=  — . 

4  A 


Figure  3.4:  Sketch  of  the  Lennard-Jones  potential  energy. 


(c)  See  figure  3.4.  Interpretation:  e  is  a  measure  for  the  depth  of  the  potential  well,  a  sets  the  length  scale  and 
therefore  the  position  of  the  equilibrium  point. 

(d)  Method  1 :  We  calculate  the  force  as  minus  the  derivative  of  the  potential  energy: 

dU  (12a12  6  a6 

dr  {  r13  r7 

For  small  r  we  have  r-13  >>  r~7 ,  so  F  is  positive  and  therefore  repulsive.  Conversely,  for  large  r  we  have 
r-13  <<  r~7 ,  so  F  is  negative  and  therefore  attractive. 

Method  2:  Use  the  sketch  in  (c)  to  to  see  that  the  slope  of  the  potential  is  negative  for  small  r,  which 
implies  a  repulsive  force,  and  the  slope  of  the  potential  is  positive  for  large  r,  which  implies  an  attractive 
force. 


(e)  For  an  equilibrium  point  we  have: 


dU  (12  a12 

~dr  ~  4e  (  ri3 


6(7 


6'  ea6  (2a6 

=  247m76”-1I’ 


(3.20) 


so  there  is  only  one  equilibrium  point,  at 


req  =  21,ba. 


To  determine  the  stability  at  this  point,  we  consider  the  second  derivative  of  U{r)\ 


(3.21) 


d2U 

~dr2 


-Beq 


a6  a12 
=  4e  142— g-  -  156— jj 


42 


156 


r=req  4Cl2  4/3ff2  2 7/3CT2 


=  _36-22/3  —  <0, 

(7Z 


(3.22) 


which  means  that  the  equilibrium  point  is  stable.  Alternatively,  we  could  have  determined  the  stability 
by  considering  the  graph  drawn  at  (c),  from  which  we  can  see  that  the  equilibrium  point  corresponds  to 
a  global  minimum  of  the  potential  energy  and  hence  is  stable. 


3.6.  Problems 
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3.6.  Problems 

3.1  (a)  Show  that,  if  you  ignore  drag,  a  projectile  fired  at  an  initial  velocity  vq  and  angle  6  has  a  range  R  given 

by 

t/nSin20 

R=— - .  (3.23) 

g 

(b)  A  target  is  situated  1.5  km  away  from  a  cannon  across  a  flat  held.  Will  the  target  be  hit  if  the  bring 
angle  is  42°  and  the  cannonball  is  fired  at  an  initial  velocity  of  121  m/s?  (Cannonballs,  as  you  know, 
do  not  bounce). 

(c)  To  increase  the  cannon’s  range,  you  put  it  on  a  tower  of  height  ho.  Find  the  maximum  range  in  this 
case,  as  a  function  of  the  bring  angle  and  velocity,  assuming  the  land  around  is  still  bat. 

3.2  You  push  a  box  of  mass  m  up  a  slope  with  angle  6  and  kinetic  friction  coefficient  p.  Find  the  minimum 
initial  speed  v  you  must  give  the  box  so  that  it  reaches  a  height  h. 

3.3  A  uniform  board  of  length  L  and  mass  M  lies  near  a  boundary  that  separates  two  regions.  In  region  1,  the 
coefficient  of  kinetic  friction  between  the  board  and  the  surface  is  fi\,  and  in  region  2,  the  coefficient  is 
P2-  Our  objective  is  to  bnd  the  net  work  W  done  by  friction  in  pulling  the  board  directly  from  region  1  to 
region  2,  under  the  assumption  that  the  board  moves  at  constant  velocity. 

Before  , 


M, 

m2 

region  1 

region  2 

During 

X 

M, 

m2 

After 

Mi 

m2 

(a)  Suppose  that  at  some  point  during  the  process,  the  right  edge  of  the  board  is  a  distance  x  from  the 
boundary,  as  shown.  When  the  board  is  at  this  position,  what  is  the  magnitude  of  the  force  of  fric¬ 
tion  acting  on  the  board,  assuming  that  it’s  moving  to  the  right?  Express  your  answer  in  terms  of  all 
relevant  variables  (L,  M,  g,  x,  jj\ ,  and  p2)- 

(b)  As  we’ve  seen  in  section  .  1 ,  when  the  force  is  not  constant,  you  can  determine  the  work  by  integrating 
the  force  over  the  displacement,  W  =  f  F[x)  dx.  Integrate  your  answer  from  (a)  to  get  the  net  work 
you  need  to  do  to  pull  the  board  from  region  1  to  region  2. 

3.4  The  government  wishes  to  secure  votes  from  car-owners  by  increasing  the  speed  limit  on  the  highway 
from  120  to  140  km/h.  The  opposition  points  out  that  this  is  both  more  dangerous  and  will  cause  more 
pollution.  Lobbyists  from  the  car  industry  tell  the  government  not  to  worry:  the  drag  coefficients  of  the 
cars  have  gone  down  signibcantly  and  their  construction  is  a  lot  more  solid  than  in  the  time  that  the 
120  km/h  speed  limit  was  set. 

(a)  Suppose  the  120  km/h  limit  was  set  with  a  Volkswagen  Beetle  [Cd  =  0.48)  in  mind,  and  the  lobbyist’s 
car  has  a  drag  coefficient  of  0. 19.  Will  the  new  car  need  to  do  more  or  less  work  to  maintain  a  constant 
speed  of  140  km/h  than  the  Beetle  at  120  km/h? 

(b)  What  is  the  ratio  of  the  total  kinetic  energy  released  in  a  full  head-on  collision  (resulting  in  an  imme¬ 
diate  standstill)  between  two  cars  both  at  140  km/h  and  two  cars  both  at  120  km/h? 

(c)  The  government  dismisses  the  opposition’s  objections  on  safety  by  stating  that  on  the  highway,  all 
cars  move  in  the  same  direction  (opposite  direction  lanes  are  well  separated),  so  if  they  all  move  at 
140  km/h,  it  would  be  just  as  safe  as  all  at  120  km/h.  The  opposition  then  points  out  that  running  a 
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Beetle  (those  are  still  around)  at  120  km/h  is  already  challenging,  so  there  would  be  speed  differences 
between  newer  and  older  cars.  The  government  claims  that  the  20  km/h  difference  won’t  matter,  as 
clearly  even  a  Beetle  can  survive  a  20  km/h  collision.  Explain  why  their  argument  is  invalid. 

3.5  Nuclear  fusion,  the  process  that  powers  the  Sun,  occurs  when  two  low-mass  atomic  nuclei  fuse  together 
to  make  a  larger  nucleus,  releasing  substantial  energy.  Fusion  is  hard  to  achieve  because  atomic  nuclei 
carry  positive  electric  charge,  and  their  electrical  repulsion  makes  it  difficult  to  get  them  close  enough  for 
the  short-range  nuclear  force  to  bind  them  into  a  single  nucleus.  The  figure  below  shows  the  potential- 
energy  curve  for  fusion  of  two  deuterons  (heavy  hydrogen  nuclei,  consisting  of  a  proton  and  a  neutron). 
The  energy  is  measured  in  million  electron  volts  (MeV,  1  eV  =  1.6  •  10-19  J),  a  unit  commonly  used  in 
nuclear  physics,  and  the  separation  is  in  femtometers  (1  fm  =  10-15  m). 


U  (MeV) 


(a)  Find  the  position(s)  (if  any)  at  which  the  force  between  two  deuterons  is  zero. 

(b)  Find  the  kinetic  energy  two  initially  widely  separated  deuterons  need  to  have  to  get  close  enough  to 
fuse. 

(c)  The  energy  available  in  fusion  is  the  energy  difference  between  that  of  widely  separated  deuterons 
and  the  bound  deutrons  after  they’ve  ‘fallen’  into  the  deep  potential  well  shown  in  the  figure.  About 
how  big  is  that  energy? 

(d)  Determine  whether  the  force  between  two  deuterons  that  are  4  fm  apart  is  repulsive,  attractive,  or 
zero. 

3.6  A  pigeon  in  flight  experiences  a  drag  force  due  to  air  resistance  given  approximately  by  F  —  bv2,  where  u 
is  the  flight  speed  and  b  is  a  constant. 

(a)  What  are  the  units  of  bl 

(b)  What  is  the  largest  possible  speed  of  the  pigeon  if  its  maximum  power  output  is  PI 

(c)  By  what  factor  does  the  largest  possible  speed  increase  if  the  maximum  power  output  is  doubled? 

3.7  (a)  For  which  value (s)  of  the  parameters  a,  p  and  y  is  the  force  given  by 

F  -  U3y3  +  az2,  /3x4y2,jxz) 

conservative? 

(b)  Find  the  force  for  the  potential  energy  given  by  U [x, y, z)  =  xy/z-  xzly. 

3.8  A  point  mass  is  connected  to  two  opposite  walls  by  two  springs,  as  shown  in  the  figure.  The  distance 
between  the  walls  is  2 L.  The  left  spring  has  rest  length  l\  =  LI 2  and  spring  constant  k\  =  k,  the  right 
spring  has  rest  length  I2  =  3L/4  and  spring  constant  k-2  —  3k. 

2  L 


K  k2 

^WWWWKHM/WWW- 

x  = 

-l  hi 

=  L 

x  =  -L 


x  —  L 
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(a)  Determine  the  magnitude  of  the  force  acting  on  the  point  mass  if  it  is  at  x  —  0. 

(b)  Determine  the  equilibrium  position  of  the  point  mass. 

(c)  Find  the  potential  energy  of  the  point  mass  as  a  function  of  x.  Use  the  equilibrium  point  from  (b)  as 
your  point  of  reference. 

(d)  If  the  point  mass  is  displaced  a  small  distance  from  its  equilibrium  position  and  then  released,  it 
will  oscillate.  By  comparing  the  equation  of  the  net  force  on  the  mass  in  this  system  with  a  simple 
harmonic  oscillator,  determine  the  frequency  of  that  oscillation.  (We’ll  return  to  systems  oscillating 
about  the  minimum  of  a  potential  energy  in  section  8.1.4,  feel  free  to  take  a  sneak  peak  ahead). 

3.9  A  block  of  mass  m  -  3.50  kg  slides  from  rest  a  distance  d  down  a  frictionless  incline  at  angle  9  -  30.0°, 
where  it  runs  into  a  spring  of  spring  constant  450  N/m.  When  the  block  momentarily  stops,  it  has  com¬ 
pressed  the  spring  by  25.0  cm. 

(a)  Find  d. 

(b)  What  is  the  distance  between  the  first  block-spring  contact  and  the  point  at  which  the  block’s  speed 
is  greatest? 

3.10  Playground  slides  frequently  have  sections  of  varying  slope:  steeper  ones  to  pick  up  speed,  less  steep 
ones  to  loose  speed,  so  kids  (and  students)  arrive  at  the  bottom  safely.  We  consider  a  slide  with  two  steep 
sections  (angle  a )  and  two  less  steep  ones  (angle  p ).  Each  of  the  sections  has  a  width  L.  The  slide  has  a 
coefficient  of  kinetic  friction  p. 


L  L  L  L 


(a)  Kids  start  at  the  top  of  the  slide  with  velocity  zero.  Calculate  the  velocity  of  a  kid  of  mass  m  at  the  end 
of  the  first  steep  section. 

(b)  Now  calculate  the  velocity  of  the  kid  at  the  bottom  of  the  entire  slide. 

(c)  If  L  —  1.0  m,  a  -  30°  and  p  =  0.5,  find  the  minimum  value  p  must  have  so  that  kids  up  to  30  kg  can 
enjoy  the  slide  [Hint:  what  is  the  minimum  requirement  for  the  slide  to  be  functional)? 

(d)  A  given  slide  has  a  —  30°,  /)  =  20°  and  p  =  0.5.  A  young  child  of  10  kg  slides  down,  while  its  cousin  of 
20  kg  sits  at  the  bottom.  When  the  sliding  kid  reaches  the  end,  the  two  children  collide,  and  together 
slide  further  over  the  ground.  The  coefficient  of  kinetic  friction  with  the  ground  is  0.70.  How  far  do 
the  two  children  slide  before  they  come  to  a  full  stop? 


3.11  In  this  problem,  we  consider  the  anharmonic  potential  given  by 


Cl  n  i?  o 

U{x)--{x-xa)  +-[x-x0)  , 


where  a,  b  and  xq  are  positive  constants. 


(3.24) 


(a)  Find  the  dimensions  of  a,  b  and  xq. 

(b)  Determine  whether  the  force  on  a  particle  at  a  position  x  »  xq  is  attractive  or  repulsive  (taking  the 
origin  as  your  point  of  reference). 

(c)  Find  the  equilibrium  point(s)  (if  any)  of  this  potential,  and  determine  their  stability. 

(d)  For  b  =  0,  the  potential  given  in  equation  (3.2? )  becomes  harmonic  (i.e.,  the  potential  of  a  harmonic 
oscillator),  in  which  case  a  particle  that  is  initially  located  at  a  non-equilibrium  point  will  oscillate. 
Are  there  initial  values  for  x  for  which  a  particle  in  this  anharmonic  potential  will  oscillate?  If  so, 
find  them,  and  find  the  approximate  oscillation  frequency;  if  not,  explain  why  not.  (NB:  As  the  prob¬ 
lem  involves  a  third  order  polynomial  function,  you  may  find  yourself  having  to  solve  a  third  order 
problem.  When  that  happens,  for  your  answer  you  can  simply  say:  the  solution  x  to  the  problem  X). 


36 


3.  Energy 


3.12  After  you  have  successfully  finished  your  mechanics  course,  you  decide  to  launch  the  book  into  an  orbit 
around  the  Earth.  However,  the  teacher  is  not  convinced  that  you  do  not  need  it  anymore  and  asks  the 
following  question:  What  is  the  ratio  between  the  kinetic  energy  and  the  potential  energy  of  the  book  in 
its  orbit? 

Let  m  be  the  mass  of  the  book,  Me  and  f?e  the  mass  and  the  radius  of  the  Earth  respectively.  The  gravita¬ 
tional  pull  at  distance  r  from  the  center  is  given  by  Newton’s  law  of  gravitation  (equation  2.1 ): 


FJr)  =  -G 


mil. 


r 


(a)  Find  the  orbital  velocity  v  of  an  object  at  height  h  above  the  surface  of  the  Earth. 

(b)  Express  the  work  required  to  get  the  book  at  height  h. 

(c)  Calculate  the  ratio  between  the  kinetic  and  the  potential  energy  of  the  book  in  its  orbit. 

(d)  What  requires  more  work,  getting  the  book  to  the  International  Space  Station  (orbiting  at  h  -  400  km) 
or  giving  it  the  same  speed  as  the  ISS? 


3.13  Using  dimensional  arguments,  in  problem  .  we  found  the  scaling  relation  of  the  escape  velocity  (the 
minimal  initial  velocity  an  object  must  have  to  escape  the  gravitational  pull  of  the  planet/moon/other 
object  it’s  on  completely)  with  the  mass  of  the  radius  of  the  planet.  Here,  we’ll  re-derive  the  result,  in¬ 
cluding  the  numerical  factor  that  dimensional  arguments  cannot  give  us. 


(a)  Derive  the  expression  of  the  gravitational  potential  energy,  C7g,  of  an  object  of  mass  m  due  to  a  gravi¬ 
tational  force  Fg  given  by  Newton’s  law  of  gravitation  (equation  2.9): 


GmM  „ 
Fe  = - ^ — r. 


Set  the  value  of  the  integration  constant  by  Ug  — <•  0  as  r  — ►  oo. 

(b)  Find  the  escape  velocity  on  the  surface  of  a  planet  of  mass  M  and  radius  R  by  equating  the  initial 
kinetic  energy  of  your  object  (when  launched  from  the  surface  of  the  planet)  to  the  total  gravitational 
potential  energy  it  has  there. 


3.14  A  cannonball  is  fired  upwards  from  the  surface  of  the  Earth  with  just  enough  speed  such  that  it  reaches 
the  Moon.  Find  the  speed  of  the  cannonball  as  it  crashes  on  the  Moon’s  surface,  taking  the  gravity  of  both 
the  Earth  and  the  Moon  into  account.  Table  contains  the  necessary  astronomical  data. 


3.15  The  draw  force  F{x)  of  a  Turkish  bow  as  a  function  of  the  bowstring  displacement  x  (for  x  <  0)  is  approx¬ 
imately  given  by  a  quadrant  of  the  ellipse 


I  Fjx)  )2  |  I  x  +  dV  _  j 

1  ^Ynax )  V  x  ) 

In  rest,  the  bowstring  is  at  x  =  0;  when  pulled  all  the  way  back,  it’s  at  x-  -d. 

(a)  Calculate  the  work  done  by  the  bow  in  accelerating  an  arrow  of  mass  m  —  37  g,  for  d  =  0.85  m  and 
^max  —  360  N. 

(b)  Assuming  that  all  of  the  work  is  converted  to  kinetic  energy  of  the  arrow,  find  the  maximum  distance 
the  arrow  can  fly.  Hint :  which  variable  can  you  control  when  shooting?  Maximize  the  distance  with 
respect  to  that  variable. 

(c)  Compare  the  result  of  (b)  with  the  range  of  a  bow  that  acts  like  a  simple  (Hookean)  spring  with  the 
same  values  of  Fmax  and  d.  How  much  further  does  the  arrow  shot  from  the  Turkish  bow  fly  than  that 
of  the  simple  spring  bow? 


3.6.  Problems 
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3.16  A  massive  cylinder  with  mass  M  and  radius  R  is  connected  to  a  wall  by  a  spring  at  its  center  (see  figure). 
The  cylinder  can  roll  back-and-forth  without  slipping. 


(a)  Determine  the  total  energy  of  the  system  consisting  of  the  cylinder  and  the  spring. 

(b)  Differentiate  the  energy  of  problem  (  )  to  obtain  the  equation  of  motion  of  the  cylinder  and  spring 

system. 

(c)  Find  the  oscillation  frequency  of  the  cylinder  by  comparing  the  equation  of  motion  at  (  )  with  that 

of  a  simple  harmonic  oscillator  (a  mass-spring  system). 

3.17  A  small  particle  (blue  dot)  is  placed  atop  the  center  of  a  hemispherical  mount  of  ice  of  radius  R  (see 
figure).  It  slides  down  the  side  of  the  mount  with  negligible  initial  speed.  Assuming  no  friction  between 
the  ice  and  the  particle,  find  the  height  at  which  the  particle  loses  contact  with  the  ice. 


Hint:  To  solve  this  problem,  first  draw  a  free  body  diagram,  and  combine  what  you  know  of  energy  and 
forces. 

3.18  Pulling  membrane  tubes 

The  (potential)  energy  of  a  cylindrical  membrane  tube  of  length  L  and  radius  R  is  given  by 


(3.25) 


Here  k  is  the  membrane’s  bending  modulus  and  a  its  surface  tension. 

(a)  Find  the  dimensions  of  the  bending  modulus  and  the  surface  tension. 

(b)  Find  the  forces  acting  on  the  tube  along  its  radial  and  axial  direction. 

(c)  Membrane  tubes  are  often  pulled  by  membrane  motors  pulling  along  the  axial  direction,  as  sketched 


in  figure  3.5.  For  that  case,  we  add  the  work  done  by  the  motors  to  the  total  energy  of  the  tube,  so  we 
get: 


(3.26) 


Show  that  for  a  stable  tube,  the  motors  need  to  exert  a  force  of  magnitude  F  =  2jt\/2k(t. 

(d)  Can  the  force  of  (c)  be  considered  to  be  an  effective  spring  force?  If  so,  find  its  associated  spring 
constant.  If  not,  explain  why  not. 
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microtubule 


Figure  3.5:  Cartoon  of  molecular  motors  together  pulling  a  membrane  tube. 


Momentum 


4.1.  Center  of  mass 

4.1.1.  Center  of  mass  of  a  collection  of  particles 

So  far  we’ve  only  considered  two  cases  -  single  particles  on  which  a  force  is  acting  (like  a  mass  on  a  spring), 
and  pairs  of  particles  exerting  a  force  on  each  other  (like  gravity).  What  happens  if  more  particles  enter  the 
game?  Well,  then  we  have  to  calculate  the  total  force,  by  vector  addition,  and  total  energy,  by  regular  addition. 
Let’s  label  the  particles  with  a  number  a,  then  the  total  force  is  given  by: 

„  „  d2  (Y.nmara\  d2 

-ZF.  -E-li.  -  =  M-2rm.  (4.1) 


where  we’ve  defined  the  total  mass  M  —  m„  and  the  center  of  mass 

1  V 

Am  =  —  )  ma  ra . 

MV 


(4.2) 


4.1.2.  Center  of  mass  of  an  obiect 

Equation  (4.2)  gives  the  center  of  mass  of  a  discrete  set  of  particles.  Of  course,  in  the  end,  every  object  is 
built  out  of  a  discrete  set  of  particles,  its  molecules,  but  summing  them  all  is  going  to  be  a  lot  of  work.  Let’s 
try  to  do  better.  Consider  a  small  sub-unit  of  the  object  of  volume  dV  (much  smaller  than  the  object,  but 
much  bigger  than  a  molecule).  Then  the  mass  of  that  sub-unit  is  dm  =  pdV,  where  p  is  the  density  (mass 
per  unit  volume)  of  the  object.  Summation  over  all  these  masses  gives  us  the  center  of  mass  of  the  object,  by 
equation  (  ).  Now  taking  the  limit  that  the  volume  of  the  sub-units  goes  to  zero,  this  becomes  an  infinite 

sum  over  infinitesimal  volumes  -  an  integral.  So  for  the  center  of  mass  of  a  continuous  object  we  find: 

rcm=  f  p-rdV.  (4.3) 

Mjv 

Note  that  in  principle  we  do  not  even  need  to  assume  that  the  density  p  is  constant  -  if  it  depends  on  the 
position  in  space,  we  can  also  absorb  that  in  the  discussion  above,  and  end  up  with  the  same  equation,  but 
now  with  p(r).  That  will  make  the  integral  a  lot  harder  to  evaluate,  but  not  necessarily  impossible.  Also  note 
that  the  total  mass  M  of  the  object  is  simply  given  by  p  ■  V,  where  V  is  the  total  volume,  if  the  density  is 
constant,  and  by  fv  p(r)  dV  otherwise.  Therefore,  if  the  density  is  constant,  it  drops  out  of  equation  (  ),  and 
we  can  rewrite  it  as 

Am  —  —  /  rdV  for  constant  density  p.  (4.4) 

V  Jv 

Unfortunately,  many  textbooks  introduce  the  confusing  concept  of  a  infinitesimal  mass  element  dm,  instead 
of  a  volume  element  dV  with  mass  pdV.  This  strange  habit  often  throws  students  off,  and  the  concept  is 
wholly  unnecessary,  so  we  won’t  adapt  it  here. 

Equation  ( 1.3)  holds  for  any  continuous  object,  but  it  might  be  confusing  if  you  consider  a  linear  or  planar 
object  -  as  you  may  wonder  how  the  density  p  and  volume  element  dV  are  defined  in  one  and  two  dimen¬ 
sions.  There  are  two  ways  out.  One  is  to  say  that  all  physical  objects  are  three-dimensional  -  even  a  very 
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thin  stick  has  a  cross  section.  If  you  say  that  cross  section  has  area  A  (which  is  constant  along  the  stick,  or 
the  thin  stick  approximation  would  be  invalid),  and  the  coordinate  along  the  stick  is  x,  the  volume  element 
simply  becomes  d  V  =  Adx,  and  the  integral  in  equation  (  )  reduces  to  a  one-dimensional  integral.  You  can 

approach  two-dimensional  objects  in  the  same  way,  by  giving  them  a  small  thickness  A z  and  writing  the  vol¬ 
ume  element  as  d  V  =  AzdA  Alternatively,  you  can  define  one-  and  two-dimensional  analogs  of  the  density: 
the  mass  per  unit  length  A  and  mass  per  unit  area  cr,  respectively.  With  those,  the  one-  and  two-dimensional 
equivalents  of  equation  ( 1.3)  are  given  by 

Xcm  =  77  [  hxdx,  and  f-cm  =  f  p-rdA, 

M  Jo  MJa 

where  M  is  still  the  total  mass  of  the  object. 

4.1.3.  Worked  example:  center  of  mass  of  a  solid  hemisphere 


Figure  4.1:  Coordinate  system  for  the  calculation  of  the  center  of  mass  for  a  solid  hemisphere. 


By  symmetry,  the  center  of  mass  of  a  solid  sphere  must  lie  at  its  center.  The  center  of  mass  of  a  hemisphere 
cannot  be  guessed  so  easily,  so  we  must  calculate  it.  Of  course,  it  must  still  lie  on  the  axis  of  symmetry,  but  to 
calculate  where  on  that  axis,  we’ll  use  equation  .  To  carry  out  the  integral,  we’ll  make  use  of  the  symmetry 
the  system  still  has,  and  chop  our  hemisphere  up  into  thin  slices  of  equal  thickness  dz,  see  figure  .  The 
volume  of  such  a  slice  will  then  depend  on  its  position  z,  and  be  given  by  dV  -  nr(z)2  dz,  where  r(z)  is  the 
radius  at  height  z.  Putting  the  origin  at  the  bottom  of  the  hemisphere,  we  easily  obtain  r(z)  =  VR2-  z2, 
where  R  is  the  radius  of  the  hemisphere.  The  position  vector  r  in  equation  simply  becomes  (0, 0,  z),  so  we 
get: 

1  rR  99,  3 

Zcm  =  !^Jo  zn[R 


lz2iJ2_Iz4 
2  4 


3 

=  -R. 
8 


(4.5) 


The  center  of  mass  of  the  solid  hemisphere  thus  lies  at  rcm  =  (0, 0, 3RI8). 


4.2.  Conservation  of  momentum 

In  equation  (  ),  what  is  the  total  force  acting  on  all  the  particles?  Well,  that’s  the  sum  of  all  the  forces  the 

particles  exert  on  each  other,  plus  all  external  forces:  Tjotai  =  Hi  Tint,/  +  Hi  Text,/-  Now  Newton’s  third  law  of 
motion  tells  us  that  the  internal  forces  come  in  opposite  pairs,  so  when  we  sum  them,  they  all  cancel,  and 
the  total  net  force  acting  on  the  particles  is  equal  to  the  sum  of  the  external  forces  acting  on  the  particles. 
Therefore,  by  equation  (  ) ,  the  center  of  mass  of  a  system  of  particles  obeys  Newton’s  second  law  of  motion. 

What  about  the  momentum  of  the  center  of  mass?  Like  the  force,  the  total  momentum  of  the  system  of 
the  system  is  given  by  the  vector  sum  of  the  individual  particle  momenta: 

r— ,  d  ^  d 

Ttotal  =  2_.  Pa  ~  2-j  a  —  “77  tnar  a  —  ——Mr cm,  (4.6) 

a  a  d t  a  Qt 

so  the  total  momentum  of  the  system  equals  that  of  the  center  of  mass.  Moreover,  as  long  as  the  mass  of  the 
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system  is  conserved,  we  can  rewrite  equation  (  )  as 

d P total 

ftotal=— (4-7) 
df 

Not  only  does  the  center  of  mass  of  a  system  of  particles  obey  Newton’s  second  law  of  motion,  its  total  mo¬ 
mentum  does  too.  Moreover,  unlike  in  the  single-particle  case,  equation  (  )  has  an  important  consequence 

for  the  case  that  there  is  no  external  force  acting  on  the  system.  For  one  particle,  that  would  simply  mean  that 
the  momentum  does  not  change  -  Newton’s  first  law  of  motion.  But  for  multiple  particles,  equation  (  )  tells 

us  that  no  external  forces  means  that  the  total  momentum  does  not  change.  We  have  therefore  arrived  at  our 
second  conservation  law: 

Theorem  4.1  (Law  of  conservation  of  momentum).  When  no  external  forces  act  on  a  system  of  particles,  the 
total  momentum  of  the  system  is  conserved. 

We  derived  the  law  of  conservation  of  momentum  by  applying  both  Newton’s  second  and  third  laws  of 
motion,  so  like  conservation  of  energy,  it  is  not  an  independent  result,  but  follows  from  our  axioms.  Note  that 
the  law  allows  for  the  momenta  of  the  individual  particles  in  the  system  to  change,  as  long  as  their  total  stays 
the  same  -  this  is  what  happens  when  you  play  billiards,  and  why  the  number  of  balls  bouncing  in  a  Newton’s 
cradle  is  fixed. 

4.3.  Reference  frames 

4.3.1.  Center  of  mass  frame 

The  center  of  mass  need  not  be  fixed  in  space,  so  it  can  have  a  nonzero  velocity,  which  of  course  is  simply 
given  by  vcm  -  rcm.  For  each  of  the  particles  in  a  multi-particle  system,  we  can  decompose  its  velocity  by 
writing  it  as  the  sum  of  the  center  of  mass  velocity  and  a  velocity  relative  to  the  center  of  mass: 

Fa  —  Fan  +  fa,  rel-  (4.8) 

In  many  applications,  the  information  is  in  the  velocity  component  relative  to  the  center  of  mass.  After  all, 
conservation  of  momentum  implies  that  for  a  system  with  no  external  forces  acting  on  it,  the  center  of  mass 
velocity  cannot  change,  even  if  all  the  individual  momenta  do  change  (as  happens  in  collisions).  Therefore, 
it  is  often  convenient  to  analyze  your  system  in  a  frame  that  moves  with  the  center  of  mass,  known  (unsur¬ 
prisingly),  as  the  center  of  mass  frame.  In  this  frame,  the  center  of  mass  velocity  is  identically  zero,  and  again 
because  of  conservation  of  momentum,  all  other  velocities  in  this  frame  must  sum  to  zero.  The  ‘real-world’ 
frame  with  nonzero  center  of  mass  velocity  is  referred  to  as  the  lab  frame. 

4.3.2.  Galilean  transformations  and  inertial  frames 

As  equation  (  .8)  shows,  if  you  know  a  particle’s  velocity  in  the  center  of  mass  frame,  you  can  easily  calculate 
the  velocity  in  the  lab  frame  by  adding  the  velocity  of  the  center  of  mass.  Going  the  other  way,  to  calculate 
the  velocity  in  the  center  of  mass  frame,  you  subtract  i/cm  from  the  velocity  in  the  lab  frame.  Moreover,  if  the 
center  of  mass  moves  at  constant  velocity,  we  can  also  easily  relate  positions  in  both  frames.  If  we  denote 
coordinates  in  the  lab  frame  by  r  and  those  in  the  center  of  mass  frame  by  r' ,  we  readily  obtain: 

r  —  r'  +  vcmt,  (4.9) 

v=v'  +  vcm.  (4.10) 

Equation  (4.9)  is  an  example  of  a  Galilean  transformation  between  frames  of  reference  (here  the  lab  frame  and 
the  center  of  mass  frame).  It  actually  holds  for  any  pair  of  reference  frames  that  move  with  constant  velocity 
with  respect  to  each  other.  Such  frames  of  reference  are  known  as  inertial  frames  if  Newton’s  first  law  of 
motion  holds  in  them;  by  Newton’s  second  law,  if  one  of  the  frames  is  an  inertial  frame,  then  the  one  obtained 
from  it  by  a  Galilean  transformation  (i.e.,  one  moving  at  constant  velocity  with  respect  to  the  first  frame)  is 
also  an  inertial  frame.  The  reason  for  this  is  that  a  constant  velocity  plays  no  role  in  Newton’s  second  law,  as  it 
relates  the  derivative  of  the  velocity  (i.e.,  the  acceleration)  to  a  force.  Consequently,  not  only  is  Newton’s  first 
law  of  motion  valid  in  both  inertial  frames  -  all  laws  of  physics  are  the  same  in  two  such  frames.  This  fact  is 
known  as  the  principle  of  relativity.  It  does  not  apply  to,  for  example,  frames  that  rotate  with  respect  to  each 
other,  as  we’ll  see  in  chapter  7.  Moreover,  although  the  principle  of  relativity  is  universally  valid  (it  is  in  fact 
one  of  the  two  basic  assumptions  behind  Einstein’s  theory  of  relativity),  the  Galilean  transformations  are  not. 
They  break  down  at  velocities  that  approach  the  speed  of  light,  as  we’ll  explore  in  detail  in  part  . 
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4.3.3.  Kinetic  energy  of  a  collection  of  particles 

We’ve  established  above  that  both  the  total  momentum  and  energy  are  conserved  in  closed  systems,  but 
the  components  can  of  course  change.  Momentum  can  be  transferred  from  one  particle  to  another,  and  so 
can  (kinetic)  energy;  moreover  kinetic  energy  can  be  generated  from  potential  energy.  Unfortunately,  unlike 
momentum,  the  kinetic  energy  of  a  collection  of  particles  does  not  equal  that  of  the  center  of  mass  -  this  is 
because  kinetic  energy  depends  quadratically  rather  than  linearly  on  the  velocity.  The  total  kinetic  energy 
does  of  course  equal  the  sum  of  the  individual  particles’  kinetic  energies.  Moreover,  here  too  the  decomposi¬ 
tion  (4.8)  is  useful: 


K 


y  ma(Vcm  T  Va, rel) '  ( U:m  +  Va, rel) 
a  2 

E  \  m<*  "cm  +  E  "cm  •  fa, rel  +  E  \  m<*  "a, rel 

a  ^  a  a  ^ 

Krm  +  Kjnt 


(4.11) 

(4.12) 

(4.13) 


Because  the  center  of  mass  velocity  is  the  same  for  all  particles,  it  can  be  taken  out  of  the  sum  in  equa¬ 
tion  (  ).  Therefore,  the  first  term  equals  jMv^m  =  Kcm ,  and  in  the  second  term  we  end  up  with  the  sum 

over  all  velocities  relative  to  the  center  of  mass  -  which  is  zero.  We  find  that  the  total  kinetic  energy  of  a  collec¬ 
tion  of  particles  equals  the  kinetic  energy  of  the  center  of  mass  plus  the  total  internal  kinetic  energy  -  which 
can  change  in  both  collisions  and  when  potential  energy  gets  converted  into  kinetic  energy  (or  vice  versa). 


4.4.  Rocket  science* 

Although  designing  a  rocket  that  will  follow  a  desired  trajectory  (say  to  Ceres,  Pluto,  or  Planet  Nine)  with  great 
accuracy  is  an  enormous  engineering  challenge,  the  basic  principle  behind  rocket  propulsion  is  remarkably 
simple.  It  essentially  boils  down  to  conservation  of  momentum,  or,  equivalently,  the  observation  that  the 
velocity  of  center  of  mass  of  a  system  does  not  change  if  no  external  forces  are  acting  on  the  system.  To 
understand  how  a  rocket  works,  imagine  the  following  experiment:  you  sit  on  a  initially  stationary  cart  with 
a  large  amount  of  small  balls.  You  then  pick  up  the  balls  one  by  one,  and  throw  them  all  in  the  same  direction 
with  the  same  (preferably  high)  speed  (relative  to  yourself  and  thus  the  cart).  What  will  happen  is  that  you, 
the  cart,  and  the  remaining  balls  slowly  pick  up  speed,  in  the  opposite  direction  from  the  one  you’re  throwing 
the  balls  in.  This  is  exactly  what  a  rocket  engine  does:  it  thrusts  out  small  particles  (molecules,  actually)  at 
high  velocities,  gaining  a  small  velocity  itself  in  the  opposite  direction.  Note  that  this  is  completely  different 
from  most  other  engines,  which  drive  the  rotation  of  wheels  (that  depend  on  friction  to  work)  or  propellers 
(that  depend  on  drag  to  work). 

4.4.1.  lOCKET  EQUATION 

To  understand  what  happens  in  our  thought  experiment,  let’s  first  consider  the  first  ball  you  throw.  Let’s  call 
the  mass  of  yourself  plus  the  cart  M,  the  total  mass  of  the  balls  m,  and  the  (small)  mass  of  a  single  ball  dm.  If 
you  trow  the  ball  with  a  speed  u  (with  respect  to  yourself),  we  can  calculate  your  resulting  speed  in  two  ways: 

1 .  The  center  of  mass  must  remain  stationary.  Let’s  put  xcm  =  0.  Before  the  throw,  we  then  have  Xbau  d m  + 
XcariM  +  m)  -  0,  whereas  after  the  throw  we  have  -  lit  dm  +  ycarf(M+  m)  =  0,  or  ycar  =  -udml  {M  +  in'). 

2.  The  total  momentum  must  be  conserved.  Before  the  throw,  the  total  momentum  is  zero,  as  nothing  is 
moving.  After  the  throw,  we  get:  pbaii  +  Pear  =  -udm  +  vcal  (M  +  m).  Equating  this  to  zero  again  gives 
vcal  -  -udml  [M  +  m). 

Now  for  the  second,  third,  etc.  ball,  the  situation  gets  more  complicated,  as  the  car  (including  the  ball  that  is 
about  to  be  thrown)  is  already  moving.  Naturally,  the  center  of  mass  of  the  car  plus  all  the  balls  remains  fixed, 
as  does  the  total  momentum  of  the  car  plus  all  the  balls.  However,  to  calculate  how  much  extra  speed  the 
car  picks  up  from  the  nth  ball,  it  is  easier  to  not  consider  the  balls  already  thrown.  Instead,  we  consider  a  car 
(including  the  remaining  balls)  that  is  already  moving  at  speed  v,  and  thus  has  total  momentum  (M  +  m)  u. 
Throwing  the  next  ball  will  reduce  the  mass  of  the  car  plus  balls  by  dm,  and  increase  its  velocity  by  dv. 
Conservation  of  momentum  then  gives: 

(M  +  m)v  -  (M  +  m  -  d m)(v+  dv)  +  [v-  u)  dm  =  (M  +  m)v  +  (M  +  m)dv-  udm , 

1  Or  carry  out,  as  you  please. 


(4.14) 


4.4.  Rocket  science 
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Konstantin  Eduardovich  Tsiolkovsky  (1857-1935)  was  a  Russian  rocket 
scientist,  who  is  considered  to  be  one  of  the  pioneers  of  cosmonautics. 
Self-taught,  Tsiolkovsky  became  interested  in  spaceflight  both  through 
‘cosmic’  philosopher  Nikolai  Fyodorov  and  science-fiction  author  Jules 
Verne  and  considered  the  construction  of  a  space  elevator  inspired  by  the 
then  newly  built  Eiffel  tower  in  Paris.  Working  as  a  teacher,  he  spent  much 
of  his  free  time  on  research,  developing  the  rocket  equation  named  after 
him  (equation  .16)  as  well  as  developing  designs  for  rockets,  including 
multi-stage  ones.  Tsiolkovsky  also  worked  on  designing  airplanes  and  air¬ 
ships  (dirigibles),  but  did  not  get  support  from  the  authorities  to  develop 
these  further.  He  kept  working  on  rockets  though,  while  also  continuing 
as  a  mathematics  teacher.  Only  late  in  life  did  he  receive  recognition  for 
his  work  at  home  (then  the  Soviet  Union),  but  his  ideas  would  go  on  to  in¬ 
fluence  other  rocket  pioneers  in  both  the  Soviet  and  American  space  pro¬ 
grams. 


where  we  dropped  the  second-order  term  dmdv.  Equation  (  )  can  be  rewritten  to 

{M+m)dv-udm.  (4.15) 

Note  that  here  both  u  (the  speed  of  each  thrown  ball)  and  M  (the  mass  of  yourself  plus  the  car,  or  the  shell 
of  a  rocket)  are  constants,  whereas  m  changes,  ending  up  at  zero  when  you’ve  thrown  all  your  balls.  To  find 
the  velocity  of  our  car,  we  can  integrate  equation  (  ),  but  there  is  an  important,  and  rather  subtle,  point 

to  consider.  The  left-hand  side  of  equation  (  )  applies  to  the  car,  but  the  right-hand  side  to  the  thrown 

ball,  with  a  (positive)  mass  dm.  The  mass  m  of  the  balls  remaining  in  the  car,  however,  has  decreased  by  dm, 
so  if  we  wish  to  know  the  final  velocity  of  the  car,  we  need  to  include  a  minus  sign  on  the  right-hand  side  of 
equation  (  ).  Dividing  through  by  M  +  m  and  integrating,  we  then  obtain: 


Av-  Vf-  Vq  —  u log 


M+  mo| 
M  ]’ 


(4.16) 


where  Vf  is  the  final  velocity  of  the  car,  and  mo  the  initial  total  mass  of  all  the  balls.  Equation  (  )  is  known 

as  the  Tsiolkovsky  rocket  equation  . 


4.4.2.  JIT- STAGE  ROCKETS 

Because  of  the  logarithmic  factor  in  the  Tsiolkovsky  rocket  equation,  rockets  need  a  lot  of  fuel  compared 
to  the  mass  of  the  object  they  intend  to  deliver  (the  payload  -  say  a  probe,  or  a  capsule  with  astronauts). 
Even  so,  the  effectiveness  of  rockets  is  limited.  A  fuel  to  payload  ratio  of  9:1  (already  quite  high)  and  an 
initial  speed  of  zero  gives  a  final  speed  Vf  -  iilog(lO)  -  2.3  u,  and  increasing  the  ratio  to  99:1  only  doubles  this 
result:  Vf  —  wlog(lOO)  -  4.6 u.  To  get  around  these  limitations  and  give  rockets  (or  rather  their  payloads)  the 
speed  necessary  to  leave  Earth,  or  even  the  solar  system,  rockets  are  built  with  multiple  stages  -  essentially 
a  number  of  rockets  stacked  one  upon  the  next.  If  these  stages  all  have  the  same  fuel  to  payload  ratio  and 
exhaust  velocity,  the  final  velocity  of  the  payload  simply  is  that  of  a  single  stage  times  the  number  of  stages  n: 
Vf  -  nu log(l  +  mol M).  To  see  this,  consider  that  the  remaining  stages  are  the  payload  of  the  current  stage. 
Having  multiple  stages  thus  allows  rockets  to  pick  up  speed  more  efficiently,  essentially  by  shedding  a  part  of 
the  ‘payload’  (casing  of  an  empty  stage).  For  example,  the  Saturn  V  rocket  that  was  used  to  send  the  Apollo 
astronauts  to  the  moon  had  three  stages,  plus  a  small  rocket  engine  on  the  capsule  itself  (used  to  break  moon 
orbit  and  send  the  astronauts  back  to  Earth),  see  figure  4.3. 


4.4.3. 

When  you’re  crashing  into  something,  there  are  two  factors  that  determine  how  much  your  momentum 
changes:  the  amount  of  force  acting  on  you,  and  the  time  the  force  is  acting.  The  product  is  known  as  the 

2 Though  Tsiolkovsky  certainly  deserves  credit  for  his  pioneering  work,  and  he  likely  derived  the  equation  independently,  he  was  not  the 
first  to  do  so.  Both  the  British  mathematician  William  Moore  in  1813  and  the  Scottish  minister  and  mathematician  William  Leitch  in 
1861  preceded  him. 
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Figure  4.3:  Rockets  and  related  spacecraft  that  took  people  to  the  moon  in  the  late  1960’s  and  early  1970’s  [15].  (a)  Aerial  view  of  a  Saturn 
V  rocket  on  its  launchpad.  This  rocket  carries  the  Apollo  15,  the  fourth  mission  to  make  it  to  the  moon.  The  three  rocket  stages  are 
separated  by  rings  around  the  engine  of  the  next  stage.  The  total  height  of  the  rocket  at  launch  was  110.6  m;  it  had  a  total  mass  of  2.97 
million  kg,  and  could  take  a  payload  of  140000  kg  to  low-Earth  orbit  or  a  payload  of  48600  kg  to  the  moon,  (b)  View  from  the  launch  tower 
of  the  Saturn  V  carrying  Apollo  11  (the  famous  first  mission  to  the  moon  in  1969)  at  ignition.  The  little  rocket  on  top  was  to  be  used  for 
an  emergency  escape  of  the  manned  module  immediately  below  if  anything  went  wrong  at  launch.  The  manned  ‘command  module’  is 
the  little  conical  structure;  the  cylindrical  structure  directly  below  it  contained  its  engine,  and  the  conical  part  below  that  contained  the 
lunar  lander  (figure  d).  (c)  Jettisoned  third  stage  of  the  Saturn  V  rocket  that  carried  the  Apollo  17  mission  (the  sixth  and  last(!)  to  make  it 
to  the  moon  in  1972).  The  empty  space  at  the  front  contained  the  lunar  lander  module  at  launch,  (d)  Lunar  lander  of  the  1969  Apollo  11 
mission,  photographed  from  the  command  module  after  separation.  This  module  contained  two  rockets:  one  to  slow  the  descent  to  the 
moon  on  the  lower  part,  and  one  to  return  to  moon  orbit  with  just  the  upper  part.  The  lower  part  of  the  lander  remains  on  the  moon  and 
was  photographed  there  in  2012  by  the  Lunar  Reconnaisance  Orbiter,  an  unmanned  spacecraft  in  Moon  orbit. 


4.5.  Collisions 
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impulse,  which  by  Newton’s  second  law  equals  the  change  in  momentum: 

J=Ap  =  Jp(t)dt.  (4.17) 

The  specific  impulse,  defined  as  /sp  =  J I  mpTopenant,  or  the  impulse  per  unit  mass  of  fuel,  is  a  measure  of 
the  efficiency  of  jet  engines  and  rockets. 

4.5.  Collisions 

Collisions  occur  when  two  (or  more)  particles  hit  each  other.  During  a  collision,  those  particles  exert  forces 
on  each  other,  but  in  general,  there  are  no  external  forces  acting  on  the  system  consisting  of  the  colliding 
particles.  Consequently,  the  total  momentum  of  all  particles  involved  in  the  collision  is  conserved.  Typically, 
we  know  the  initial  velocities  and  the  masses  of  the  particles,  and  want  to  calculate  their  final  velocities  - 
though  of  course  you  could  also  do  it  the  other  way  around. 

Although  the  total  momentum  of  colliding  particles  is  conserved,  the  total  (kinetic)  energy  of  all  particles 
typically  is  not,  as  irreversible  processes  (such  as  plastic  deformations  of  the  particles)  occur  that  are  associ¬ 
ated  with  nonconservative  forces.  In  the  special  case  that  kinetic  energy  is  conserved  in  a  collision,  we  call 
the  collision  (totally)  elastic.  All  other  collisions  are  called  inelastic,  with  the  extreme  case  of  a  totally  inelastic 
collision,  in  which  the  colliding  objects  stick  together. 

4.6.  Totally  inelastic  collisions 

For  the  case  of  two  particles  colliding  totally  inelastically,  conservation  of  momentum  gives: 


m\  v\  +  m2V2  =  [m\  +  m2) Vf.  (4.18) 

If  the  masses  and  initial  velocities  of  the  particles  are  known,  calculating  the  final  velocity  of  the  composite 
particle  is  thus  straightforward. 

4.6.1.  Worked  example:  bike  crash 


1 


t 


Figure  4.4:  Two  colliding  cyclists. 


You’re  late  for  class  and  it’s  raining  to  boot,  so  you  cycle  as  fast  as  you  possibly  can,  without  paying  atten¬ 
tion  in  which  direction  you’re  going.  A  classmate,  similarly  late,  comes  towards  you  from  a  side  street  that 
makes  an  angle  <p  with  yours.  When  your  streets  cross,  you  crash  into  each  other,  moving  together  in  a  big 
clutch  of  people  and  bikes,  see  figure  .  Suppose  you’re  about  equally  heavy,  but  you’re  the  faster  biker, 
with  initially  twice  the  speed  of  your  classmate.  Find  the  velocity  (i.e.,  magnitude  and  direction)  you  and  your 
classmate  have  immediately  after  the  collision. 
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Solution 

Let’s  call  your  (and  your  classmate’s)  mass  m,  your  initial  speed  2v  (so  your  classmate’s  speed  is  v),  and  your 
combined  final  speed  Vf,  with  an  angle  9  with  your  initial  direction.  After  the  collision  you  move  as  one  object, 
so  the  collision  is  completely  inelastic.  During  the  collision  we  have  conservation  of  momentum  in  both  the 
x  and  y  directions,  which  gives: 


0+  v  sirup  =  tysind, 
2v-  vcostp  -  Vfcos 9. 


(4.19) 

(4.20) 


We  need  to  solve  for  both  Vf  and  9.  To  eliminate  9,  we  square  both  (  )  and  (4.2C)  and  add  them,  which 

gives 

=  f2sin  2(p+  i'2(2-cos  cp)2  -  v2{5  -  4cos(p),  (4.21) 

or  Vf  =  y-y/5  -  4cos <p.  To  get  9,  we  divide  (  )  by  (  ),  which  gives 

sirup 


tand  = 


2-  cos  (p 

We  can  easily  check  that  these  answers  make  sense  when  <p-  0,  which  gives  Vf  =  v  and  9  -  0,  as  it  should. 


(4.22) 


4.7.  Totally  elastic  collisions 

For  a  totally  elastic  collision,  we  can  invoke  both  conservation  of  momentum  and  (by  definition  of  a  totally 
elastic  collision)  of  kinetic  energy.  We  also  have  an  additional  variable,  as  compared  to  the  totally  inelas¬ 
tic  case,  because  in  this  case  the  objects  do  not  stick  together  and  thus  get  different  end  speeds.  The  two 
equations  governing  a  totally  elastic  collision  are: 


mi  vyi  +  m2  v2,i  =  m\  i/lif  +  m2  v2,u 


(4.23) 


for  momentum  conservation,  and 


\  ml  vl,i  +  \  m2  4i  =  \TU'  Vl.f  +  \  m2  vli 


(4.24) 


for  kinetic  energy  conservation. 

When  the  collision  occurs  in  one  dimension,  we  can  combine  equations  (  .23)  and  (  2‘ )  to  calculate  the 
final  velocities  as  functions  of  the  initial  ones.  We  first  rewrite  the  two  equations  so  that  everything  associated 
with  particle  1  is  on  the  left,  and  the  terms  for  particle  2  are  on  the  right: 


mi(vi,i-  vyf)  =  m2{v 2,f-  v2ii),  (4.25) 

and 

miiv\A-  vfj)  =  m2{vlx-  vli).  (4-26) 

We  can  expand  the  terms  in  parentheses  in  equation  (  26),  which  gives: 

iq,f)(i'i,i+  iq,f)  =  m2{v2y-  v2ii)[v2i{+  f2,i)-  (4.27) 

Dividing  equation  ( 1.27)  by  equation  ( 1.25),  we  get  a  relation  between  the  velocities  alone: 


Vl,i+  Viy  =  V2}i  +  V2y. 


(4.28) 


From  equation  (  )  we  can  isolate  v2y  and  substitute  back  in  (  )  to  find  i’\  j  in  terms  of  the  initial  veloci¬ 

ties: 


m  i  -  m2  m2 

L’l.f  =  - Vli  +  2 - V2y 

mi  +  m2  mi  +  m2 


(4.29) 


Naturally,  we  could  just  as  well  have  calculated  v2y,  the  equation  for  which  is  just  (  )  with  the  l’s  and  2’s 

swapped: 

mi  m2  -  m\ 

v2{  =  2 - vi,i  + - v2,\. 

mi  +  m2  mi  +  m2 


We  note  that  in  the  limit  case  that  m2  >>  mi,  v2  is  hardly  affected,  and  L’\j  -  -  v\  \  +2v2y. 


(4.30) 


4.8.  Elastic  collisions  in  the  COM  frame 
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4.8.  Elastic  collisions  in  the  COM  frame 

Equations  (  .2  )  and  (  )  give  the  final  velocities  of  two  particles  after  a  totally  elastic  collision.  We  did  the 

calculation  in  the  lab  frame,  i.e.,  from  the  point  of  view  of  a  stationary  observer.  We  could  of  course  just  as 
well  have  done  the  calculation  in  the  center-of-mass  (COM)  frame  of  section  .  Within  that  frame,  as  we’ll 
see  below,  the  relation  between  the  initial  and  final  velocities  in  an  elastic  collision  is  much  simpler  than  in 
the  lab  frame. 

We  will  use  equation  (  8)  to  calculate  velocities  in  the  COM  frame.  For  notational  simplicity,  we’ll  work 
in  one  dimension,  and  use  an  overbar  to  indicate  velocities  in  the  COM  frame,  so  we  get 


Vi  —  t’cm  +  O' 


for  each  particle  i.  The  velocity  of  the  center  of  mass  is  simply  the  time  derivative  of  its  position.  For  two 
particles,  it  is  given  by 


_  m  i  iq  +  m2  v2 

fcm  — 


(4.31) 


m  i  +  m2 


The  velocities  of  the  two  particles  in  the  COM  frame  is  then 


V\  =  V\~  L’cm  = 
V2  =  V2-  ton  = 


m  ?  m2 

- (iq  -  V2)  =  - Oel, 

m\  +  m2  mi  +  m2 

m\  nil 

- (v2-vi)  = - Oel. 

mi  +  m2  mi  +  m2 


(4.32) 

(4.33) 


where  vle\  =  iq  -  v2  —  iq  -  v2  is  the  relative  velocity  of  the  two  particles  .  Note  that  it  does  not  matter  whether 
we  calculate  the  relative  velocity  in  the  lab  or  COM  frame.  Equations  (  )  and  (  )  have  a  nice  symmetry 

in  their  velocity  components,  but  not  in  their  mass  components.  The  symmetry  is  more  complete  if  instead 
of  velocities,  we  consider  momenta  in  the  COM  frame,  where  p-mv: 


mi  m2 

Pi  -mivi  = - vlel  =  pvle\, 

mi  +  m2 

mi  m2 

p2  =  m2V2  = - Vle\  =  ~pvleh 

mi  +  m2 


(4.34) 

(4.35) 


where  we  introduced  a  new  variable  p,  the  reduced  mass 


mim2 
mi  +  m2 


(4.36) 


Clearly  the  total  momentum  in  the  center  of  mass  frame  is  zero  (as  it  should  be),  both  before  and  after  a 
collision,  and  is  thus  conserved.  To  find  out  what  happens  with  the  relative  velocity  in  an  elastic  collision,  we 
invoke  conservation  of  kinetic  energy,  which  we  calculate  using  K  -  \mv2  -  p2 12  m: 


Pji  {  Pji  _  Pjf  }  Pjf 

2mi  2m2  2mi  2m2 
2  -2  2  -2  2  -2  2  -2 

p  Vi  p  Vi  _  P  Vf  ,  P  Vf 

- 1 - _ - 1 - 

2mi  2m2  2mi  2m2 


(4.37) 

(4.38) 

(4.39) 


We  find  that  either  Dre\  \-  =  vrP\  \,  in  which  case  there  would  be  no  collision  (as  nothing  changes),  or  Frei,f  - 
-  V,i>  which  means  that  in  an  elastic  collision  in  the  COM  frame,  the  velocities  (and  momenta)  of  the  collid¬ 
ing  particles  reverse.  We  get: 


m2 

^i,f  =  —  ^i,i  = - (iq,i-  i;2,i) 

mi  +  m2 
mi 

v2,f  =-v2,i  = - ( v2yi  -  iq,i) 

m  i  +  m  2 


m2 

vrel  ,i> 

mi  +  m2 
nil 

t'rel  ,i- 

mi  +  m2 


(4.40) 

(4.41) 


We  can  of  course  transform  these  expressions  back  to  the  lab  frame  by  adding  the  center  of  mass  veloc¬ 
ity  (  .31),  which  gives  equations  (  29)  and  ( 1.30),  the  same  as  our  calculation  in  the  lab  frame. 


Equations  p  )  and  (  )  hold  in  multiple  dimensions  as  well. 

4Since  the  total  momentum  in  the  COM  frame  is  zero,  the  frame  is  sometimes  also  referred  to  as  the  zero-momentum  frame. 
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4.  Momentum 


4.9.  Problems 


4.1  Celestial  centers  of  mass  We  say  that  the  Moon  orbits  the  Earth,  because  the  Earth’s  gravity  pulls  on  the 
Moon,  causing  it  to  orbit.  However,  by  Newton’s  third  law,  the  Moon  exerts  a  force  back  on  the  Earth. 
Therefore,  the  Earth  should  move  in  response  to  the  Moon.  Thus  a  more  accurate  statement  would  be 
that  the  Moon  and  the  Earth  both  orbit  the  center  of  mass  of  the  Earth-Moon  system.  Useful  values: 
Me  =  5.97- 1024  kg,  RE  =  6.37- 106  m,  i?E,orbit  =  1.50- 1011  m,  Mm  -  7.35- 1022  kg,  RM  =  1.74- 106  m,  f?M, orbit  = 
3.84  •  108  m,  Ms  =  1.99  •  103°  kg,  Rs  =  6.96  •  108  m. 

(a)  Find  the  center  of  mass  of  the  Earth-Moon  system  (in  an  appropriately  chosen,  and  clearly  defined, 
coordinate  system) .  Does  this  center  of  mass  lie  inside  the  Earth? 

(b)  Find  the  location  of  the  center  of  mass  of  the  Earth-Moon-Sun  system  during  a  full  Moon. 

(c)  Find  the  location  of  the  center  of  mass  of  the  Earth-Moon-Sun  system  when  the  Moon  is  in  its  first 


quarter. 


4.2  A  shell  is  shot  with  an  initial  velocity  of  20  m/s,  at  an  angle  of  60°  with  the  horizontal.  At  the  top  of  the 
trajectory,  the  shell  explodes  into  two  fragments  of  equal  mass.  One  fragment,  whose  speed  immediately 
after  the  explosion  is  zero,  drops  to  the  ground  vertically.  How  far  from  the  gun  does  the  other  fragment 
land  (assuming  no  air  drag  and  level  terrain)? 

4.3  Two  cannonballs  with  masses  m\  and  m 2  are  simultaneously  fired  from  two  cannons  situated  a  distance  L 
apart. 

(a)  Find  the  equations  of  motion  for  the  horizontal  and  vertical  components  of  the  vector  describing  the 
center  of  mass  of  the  cannonballs. 

(b)  Show  that  the  motion  of  the  center  of  mass  is  a  parabola  through  space. 

4.4  Center  of  mass  of  some  solid  objects 


w 


(a)  Find  the  center  of  mass  of  an  isosceles  triangle  with  a  base  width  w  and  height  h  (see  figure  a). 


(b)  Find  the  center  of  mass  of  a  pentagon  with  five  equal  sides  of  length  a,  but  with  one  triangle  missing 
(see  figure  b).  Hint:  use  your  result  from  (a). 

(c)  Find  the  position  of  the  center  of  mass  of  a  semicylinder  (half  of  a  solid  cylinder,  i.e.,  a  solid  cylinder 
sliced  in  two  along  a  plane  containing  its  symmetry  axis).  Hint,  first  calculate  the  center  of  mass  of 
half  a  solid  disk. 

4.5  A  dog  (black  dot  in  the  sketch  below)  of  mass  m  stands  at  the  end  of  a  boat  of  mass  M  and  length  L  at 
an  initial  distance  D  from  the  shore.  The  dog  then  walks  to  the  other  end  of  the  boat  and  stops  there. 
Assuming  no  friction  between  the  boat  and  the  water,  how  far  is  the  dog  then  from  the  shore?  ( Hint : 
what  is  conserved?). 


4.9.  Problems 
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Figure  4.5:  Serena  Williams  serving  at  the  2008  Wimbledon  championships  [  ]. 

4.6  [For  optional  section  4]  Every  point  in  a  tennis  match  stars  with  one  of  the  players  serving.  The  most 
commonly  used  service  involves  the  player  tossing  the  ball  in  the  air  and  hitting  it  with  their  racket.  To 
get  the  ball  to  move  as  fast  as  possible,  players  commonly  swing  the  racket  to  give  it  a  large  momentum, 
and  deliver  a  maximal  impulse  to  the  ball. 

Figure  shows  Serena  Williams  serving  during  the  2008  Wimbledon  championships.  Williams  is  widely 
regarded  as  one  of  the  best  women  tennis  players  and  holds  the  record  of  most  aces  (scoring  a  point  from 
a  serve  without  the  opponent  touching  the  ball)  by  a  female  player  at  a  Grand  Slam  tournament. 

Williams  is  175  cm  tall.  As  you  can  see  in  the  figure,  at  the  top  of  its  trajectory,  the  ball  is  about  twice 
Williams’  height  above  the  ground.  Also,  as  the  span  of  people’s  arms  is  about  the  same  as  their  height, 
and  the  shoulders  of  an  adult  are  at  about  5 /6th  of  their  height,  we  can  estimate  Williams’  arms  to  be 
about  75  cm  long  and  her  shoulders  to  be  at  145  cm  above  the  ground.  The  distance  between  the  point 
where  a  player  holds  the  racket  and  where  they  hit  the  ball  is  typically  about  40  cm. 

(a)  Find  the  speed  of  the  ball  coming  down  at  the  moment  Williams  hits  it,  assuming  that  she  hits  it  with 
a  fully  upward  stretched  arm. 

(b)  Williams’s  personal  record  for  serve  speed  (speed  of  the  ball  after  it  was  hit  by  the  racket)  is  207  km/h. 
Determine  the  impulse  she  delivered  with  her  racket  on  the  58.0  g  tennis  ball  as  she  hit  it. 

(c)  Assuming  a  typical  racket  weight  of  360  g,  calculate  the  change  in  speed  of  the  racket  from  just  before 
to  just  after  Williams  hit  the  ball. 

(d)  Calculate  the  magnitude  of  the  force  on  Williams’  hand  at  the  moment  she  hits  the  ball  with  her 
racket. 

4.7  A  2.75  g  bullet  embeds  itself  in  a  1.50  kg  block,  which  is  attached  to  a  spring  of  force  constant  850  N/m. 
The  maximum  compression  of  the  spring  is  4.30  cm. 

(a)  Determine  the  initial  speed  of  the  bullet. 

(b)  Find  the  time  it  takes  the  bullet-block  system  to  come  to  rest. 

4.8  Head-on  collision  between  two  balls  A  ball  of  mass  m  has  velocity  v  when  it  makes  a  head-on  collision 
with  another  ball  of  mass  M  that  is  originally  at  rest.  After  the  collision  the  ball  of  mass  m  rebounds 
straight  back  along  its  path  with  2 /3rd  of  its  initial  kinetic  energy.  We  assume  that  the  collision  is  totally 
elastic. 

(a)  Sketch  the  situation  before  and  after  the  collision,  indicating  directions  of  velocity,  and  values  (if 
known,  give  symbols  otherwise). 

(b)  Write  down  all  applicable  conservation  laws  for  this  case. 
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4.  Momentum 


(c)  From  the  conservation  laws,  solve  for  the  mass  M  of  the  ball  that  is  initially  at  rest. 

(d)  Also  solve  for  the  velocity  of  that  ball  after  the  collision. 

4.9  A  small  ball  of  mass  m  is  aligned  above  a  larger  ball  of  mass  M  with  a  slight  separation,  and  the  two  are 
dropped  simultaneously  from  a  height  h.  Assume  the  radii  of  the  two  balls  and  the  initial  separation  are 
negligible  compared  to  h. 

(a)  If  the  larger  ball  rebounds  elastically  from  the  floor  and  then  the  small  ball  rebounds  elastically  from 
the  larger  ball,  what  value  of  m  (as  a  fraction  of  M)  results  in  the  larger  ball  stopping  when  it  collides 
with  the  small  ball? 

(b)  What  height  does  the  small  ball  then  reach? 


5 

Rotational  motion,  torque  and 

ANGULAR  MOMENTUM 


5.1.  Rotation  basics 

So  far,  we’ve  been  looking  at  motion  that  is  easily  described  in  Cartesian  coordinates,  often  moving  along 
straight  lines.  Such  kind  of  motion  happens  a  lot,  but  there  is  a  second  common  class  as  well:  rotational 
motion.  It  won’t  come  as  a  surprise  that  to  describe  rotational  motion,  polar  coordinates  (or  their  3D  coun¬ 
terparts  the  cylindrical  and  spherical  coordinates)  are  much  handier  than  Cartesian  ones  .  For  example,  if 
we  consider  the  case  of  a  disk  rotating  with  a  uniform  velocity  around  its  center,  the  easiest  way  to  do  so  is  to 
specify  over  how  many  degrees  (or  radians)  a  point  on  the  boundary  advances  per  second.  Compare  this  to 
linear  motion  -  that  is  specified  by  how  many  meters  you  advance  in  the  linear  direction  per  second,  which  is 
the  speed  (with  dimension  LI  T).  The  change  of  the  angle  per  second  gives  you  the  angular  speed  at,  where  a 
counterclockwise  rotation  is  taken  to  be  in  the  positive  direction.  The  angular  speed  has  dimension  1/T,  so 
it  is  a  frequency.  It  is  measured  in  degrees  per  second  or  radians  per  second.  If  the  angle  at  a  point  in  time  is 
denoted  by  0{t),  then  obviously  co-6,  just  like  v  —  x  in  linear  motion. 

In  three  dimensions,  oo  becomes  a  vector,  where  the  magnitude  is  still  the  rotational  speed,  and  the  direc¬ 
tion  gives  you  the  direction  of  the  rotation,  by  means  of  a  right-hand  rule:  rotation  is  in  the  plane  perpendic¬ 
ular  to  u>,  and  in  the  direction  the  fingers  of  your  right  hand  point  if  your  thumb  points  along  oo  (this  gives  oo 
in  the  positive  z  direction  for  rotational  motion  in  the  xy  plane). 

Going  back  to  2D  for  the  moment,  let’s  call  the  angular  position  6 It),  then 

d  0  ■ 

0)=  —  =  0.  (5.1) 

df 

If  we  want  to  know  the  position  r  in  Cartesian  coordinates,  we  can  simply  use  the  normal  conversion  from 
polar  to  Cartesian  coordinates,  and  write 

r(f)  =  rcos(cot)x  +  rsin(<yf)j>  =  rr,  (5.2) 

where  r  is  the  distance  to  the  origin.  Note  that  r  points  in  the  direction  of  the  polar  unit  vector  r.  Equa¬ 
tion  (  )  gives  us  an  interpretation  of  oo  as  a  frequency:  if  we  consider  an  object  undergoing  uniform  rotation 

(i.e.,  constant  radius  and  constant  velocity),  in  its  x  and  y-directions  it  oscillates  with  frequency  oo. 

As  long  as  our  motion  remains  purely  rotational,  the  radial  distance  r  does  not  change,  and  we  can  find 
the  linear  velocity  by  taking  the  time  derivative  of  (  ) : 

v{t)  —  r{t)  —  -oor  sin(dit)Jc  +  corcos(cot)y  -  oorO,  (5.3) 


so  in  particular  we  have  v  —  cor.  Note  that  both  v  and  oo  denote  instantaneous  speeds,  and  equation  (  ) 

only  holds  when  co  is  constant.  However,  the  relation  v  —  cor  always  holds.  To  see  that  this  is  true,  express  0  in 
radians,  0  -  sir,  where  s  is  the  distance  traveled  along  the  rotation  direction.  Then 


d0  1  ds  v 
df  r  df  r ' 

1  If  you  need  a  refresher  on  polar  coordinates,  or  are  unfamiliar  with  polar  basis  vectors,  check  out  appendix .  ..2. 


(5.4) 
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In  three  dimensions,  we  find 

v  -  oj  x  r,  (5.5) 

where  r  points  from  the  rotation  axis  to  the  rotating  point. 

Unlike  in  linear  motion,  in  rotational  motion  there  is  always  acceleration,  even  if  the  rotational  velocity  a> 
is  constant.  This  acceleration  originates  in  the  fact  that  the  direction  of  the  (linear)  velocity  always  changes  as 
points  revolve  around  the  center,  even  if  its  magnitude,  the  net  linear  speed,  is  constant.  In  that  special  case, 
taking  another  derivative  gives  us  the  linear  acceleration,  which  points  towards  the  center  of  rotation: 


a{t)  =  r(t)  =  -a)2r  cos{(jjt)x  +  <y2rsin(<t)f)  j>  =  -io2rr. 


(5.6) 


In  section  below  we  will  use  equation  (  )  in  combination  with  Newton’s  second  law 

late  the  net  centripetal  force  required  to  maintain  rotation  at  a  constant  rate. 

Of  course  the  angular  velocity  o>  need  not  be  constant  at  all.  If  it  is  not,  we  can  define 
ation  by  taking  its  time  derivative: 


of  motion  to  calcu- 
an  angular  acceler- 

(5.7) 


or  in  three  dimensions,  where  w  is  a  vector: 


(5.8) 


Note  that  when  a  is  parallel  to  to,  it  simply  represents  a  change  in  the  rotation  rate  (i.e.,  a  speeding  up/slowing 
down  of  the  rotation),  but  when  it  is  not,  it  also  represents  a  change  of  the  plane  of  rotation. 

In  both  two  and  three  dimensions,  a  change  in  rotation  rate  causes  the  linear  acceleration  to  have  a  com¬ 
ponent  in  the  tangential  direction  in  addition  to  the  radial  acceleration  (  ).  The  tangential  component  of 

the  acceleration  is  given  by  the  derivative  of  the  linear  velocity: 


Cl\  - 


dv  dw 


dt 


df 


ra. 


(5.9) 


In  two  dimensions,  at  points  along  the  ±8  direction. 

Naturally,  there  are  even  more  complicated  possibilities  -  the  radius  of  the  rotational  motion  can  change 
as  well.  We’ll  look  at  that  case  in  more  detail  in  chapter  6,  but  first  we  consider  ‘pure’  rotations,  where  the 
distance  to  the  rotation  axis  is  fixed. 


5.2.  Centripetal  force 

When  you’re  rotating  at  constant  angular  velocity,  the  magnitude  of  your  velocity  is  always  the  same,  but  its 
direction  constantly  changes  -  so  you’re  constantly  undergoing  an  acceleration,  as  indicated  in  equation  (5.6). 
Therefore  there  must  be  a  net  force  acting  on  you.  We  can  calculate  that  net  force  using  Newton’s  second  law 
of  motion.  It  is  known  as  the  centripetal  force  and  given  by: 


Fcp 


rat2,  n  x 

=  ma — - r  =  -mw  rr. 

r 


(5.10) 


‘Centripetal’  means  ‘center-seeking’  (from  Latin  ‘centrum’  =  center  and  ‘petere’  =  to  seek).  It  is  important 
to  remember  that  this  is  a  net  resulting  force,  not  a  ‘new’  force  like  that  exerted  by  gravity  or  a  compressed 
spring.  Equation  (5.10)  is  after  all  just  a  special  case  of  Newton’s  second  law  of  motion. 


5.3.  Torque 

Anyone  who  has  ever  used  a  lever  -  that  is  everyone,  presumably  -  knows  how  useful  they  are  at  augmenting 
force:  you  push  with  a  small  force  at  the  long  end,  to  produce  a  large  force  at  the  short  end,  and  make  the  crank 
turn,  stone  lift  or  bottle  cap  pop  off.  If  the  force  is  at  straight  angles  with  the  lever  arm  (the  line  connecting 
the  point  at  which  you  exert  the  force  to  the  pivot  around  which  your  lever  rotates),  the  effect  is  largest.  In 
that  case  we  define  the  torque  (or  moment  of  force)  t  as  the  product  of  the  force  and  the  lever  arm  length.  As 
only  the  perpendicular  component  of  the  force  counts  (you’ll  simply  push  or  pull  on  your  lever  with  a  parallel 
component,  not  make  it  turn) ,  in  a  vector  setting  you  need  to  project  on  that  perpendicular  component,  so  if 
r  (from  the  pivot  to  the  point  at  which  the  force  acts)  makes  an  angle  8  with  the  force  vector  F,  the  magnitude 
of  the  torque  becomes  t  =  rFsinO.  That’s  exactly  the  magnitude  of  the  cross  product  of  r  and  F,  which 
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Object 

Rotation  axis 

Moment  of  inertia 

Stick 

Stick 

Cylinder,  hollow 
Cylinder,  solid 

Sphere,  hollow 

Sphere,  solid 

Planar  object,  size  ax  b 

Planar  object,  size  ax  b 

Center,  perpendicular  to  stick 
End,  perpendicular  to  stick 
Center,  parallel  to  axis 

Center,  parallel  to  axis 

Any  axis  through  center 

Any  axis  through  center 

Axis  through  center,  in  plane, 
parallel  to  side  with  length  a 
Axis  through  center, 
perpendicular  to  plane 

|ml2 

MR2 

±MR2 

1 MR 2 
|  MR2 

J2  Mb2 

±M{a2  +  b2) 

Table  5.1:  Moments  of  inertia  for  some  common  objects,  all  with  total  mass  M  and  length  L  /  radius  R. 


also  has  directional  information  -  useful  as  a  torque  can  be  clockwise  or  counterclockwise.  In  general  we’ll 
therefore  define  the  torque  by: 

r  -  r  x  F,  (5.11) 

which  makes  a  counterclockwise  torque  positive,  in  correspondence  with  the  definition  of  the  rotation  vector 

(JL>. 


5.4.  Moment  of  inertia 

Suppose  we  have  a  mass  m  at  the  end  of  a  massless  stick  of  length  r,  rotating  around  the  other  end  of  the 
stick.  If  we  want  to  increase  the  rotation  rate,  we  need  to  apply  a  tangential  acceleration  at  =  ra,  for  which 
by  Newton’s  second  law  of  motion  we  need  a  force  F  =  mat  —  mra.  This  force  in  turn  generates  a  torque  of 
magnitude  t  =  r-F  —  mr2a.  The  last  equality  is  reminiscent  of  Newton’s  second  law  of  motion,  but  with  force 
replaced  by  torque,  acceleration  by  angular  acceleration,  and  mass  by  the  quantity  mr2.  In  analog  with  mass 
representing  the  inertia  of  a  body  undergoing  linear  acceleration,  we’ll  identify  this  quantity  as  the  inertia  of 
a  body  undergoing  rotational  acceleration,  which  we’ll  call  the  moment  of  inertia  and  denote  by  I: 

t  -  la.  (5.12) 

Equation  (  )  is  the  rotational  analog  of  Newton’s  second  law  of  motion.  By  extending  our  previous  exam¬ 
ple,  we  can  find  the  moment  of  inertia  of  an  arbitrary  collection  of  particles  of  masses  ma  and  distances  to 
the  rotation  axis  ra  (where  a  runs  over  all  particles),  and  write: 

a 

which  like  the  center  of  mass  in  section  easily  generalizes  to  continuous  objects  as 

I  =  f  (r  ■  r)pdV  =  [  pr2  dV.  (5.14) 

Jv  Jv 

Note  that  it  matters  where  we  choose  the  rotation  axis.  For  example,  the  moment  of  inertia  of  a  rod 
of  length  L  and  mass  m  around  an  axis  through  its  center  perpendicular  to  the  rod  is  T  m  l2 ,  whereas  the 
moment  of  inertia  around  an  axis  perpendicular  to  the  rod  but  located  at  one  of  its  ends  is  |  mL 2 .  Also, 
moments  of  inertia  are  different  for  hollow  and  solid  objects  -  a  hollow  sphere  of  mass  m  and  radius  R  has 
|  mR2  whereas  a  solid  sphere  has  |  mR2,  and  for  hollow  and  solid  cylinders  (or  hoops  and  disks)  around  the 
long  axis  through  the  center  we  find  mr2  and  \  mr2  respectively.  These  and  some  other  examples  are  listed 
in  table  .  Below  we’ll  relate  the  moment  of  inertia  to  the  kinetic  energy  of  a  moving-and-rolling  object,  but 
first  we  present  two  handy  theorems  that  will  help  in  calculating  them. 

Theorem  5.1  (Parallel  axis  theorem).  If  the  moment  of  inertia  of  a  rigid  body  about  an  axis  through  it’s  center 
of  mass  is  given  by  Icm,  then  the  moment  of  inertia  around  an  axis  parallel  to  the  original  axis  and  separated 
from  it  by  a  distance  d  is  given  by 

I=Icm  +  md2,  (5.15) 

2Like  the  one-  and  two-dimensional  analogs  of  the  center  of  mass  of  a  continuous  object  (4.3),  there  are  one-  and  two-dimensional 
analogs  of  (5.14),  which  you  get  by  replacing  p  with  A  or  a  and  dV  by  dx  or  d A,  respectively. 
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where  m  is  the  object’s  mass. 

Proof.  Choose  coordinates  such  that  the  center  of  mass  is  at  the  origin,  and  the  original  axis  coincides  with 
the  z  axis.  Denote  the  position  of  the  point  in  the  xy  plane  through  which  the  new  axis  passes  by  d,  and  the 
distance  from  that  point  for  any  other  point  in  space  by  rf; ,  such  that  r  =  d  +  .  Now  calculate  the  moment  of 

inertia  about  the  new  axis  through  d: 


I 


(5.16) 


Jv 


Here  d2  —  d  •  d.  The  last  integral  in  the  last  line  of  (  )  is  equal  to  the  position  of  the  center  of  mass,  which 


we  chose  to  be  at  the  origin,  so  the  last  term  vanishes,  and  we  arrive  at  (5.15). 


□ 


Note  that  the  first  two  lines  of  table  (moments  of  inertia  of  a  stick)  satisfy  the  perpendicular-axis  the¬ 
orem. 

Theorem  5.2  (Perpendicular  axis  theorem).  If  a  rigid  object  lies  entirely  in  a  plane,  and  the  moments  of  inertia 
around  two  perpendicular  axes  x  and  y  in  that  plane  are  Ix  and  Iy,  respectively,  then  the  moment  of  inertia 
around  the  axis  z  perpendicular  to  the  plane  and  passing  through  the  intersection  point,  is  given  by 


h  -  Ix+  Iy- 


(5.17) 


Proof.  We  simply  calculate  the  moment  of  inertia  around  the  z-axis  (where  A  is  the  area  of  the  object,  and  a 
the  mass  per  unit  area): 


(5.18) 


□ 


Note  that  the  last  two  lines  of  table  5.1  (moments  of  inertia  of  a  thin  planar  rectangle)  satisfy  the  parallel 
axis  theorem. 


5.5.  Kinetic  energy  of  rotation 


Naturally,  a  rotating  object  has  kinetic  energy  -  its  parts  are  moving  after  all  (even  if  they’re  just  rotating 
around  a  fixed  axis).  The  total  kinetic  energy  of  rotation  is  simply  the  sum  of  the  kinetic  energies  of  all  ro¬ 
tating  parts,  just  like  the  total  translational  kinetic  energy  was  the  sum  of  the  individual  kinetic  energies  of 
the  constituent  particles  in  section  .  Using  that  v  =  or,  we  can  write  for  a  discrete  collection  of  particles: 


(5.19) 


by  the  definition  (  13)  of  the  moment  of  inertia  I.  Analogously  we  find  for  a  continuous  object,  using  (  ): 


(5.20) 


so  we  arrive  at  the  general  rule: 


(5.21) 


Naturally,  the  work-energy  theorem  (eq.  )  still  holds,  so  we  can  use  it  to  calculate  the  work  necessary  to 
effect  a  change  in  rotational  velocity,  which  by  equation  (  )  can  also  be  expressed  in  terms  of  the  torque 

(in  2D): 


(5.22) 


5.6.  Angular  momentum 
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5.6.  Angular  momentum 

In  analogy  with  the  definition  of  torque,  r  =  r  x  F  as  the  rotational  counterpart  of  the  force,  we  define  the 
angular  momentum  L  as  the  rotational  counterpart  of  momentum: 

L-  r  x  p.  (5.23) 

For  a  rigid  body  rotating  around  an  axis  of  symmetry,  the  angular  momentum  is  given  by 

L  =  I(x>,  (5.24) 

where  I  is  the  moment  of  inertia  of  the  body  with  respect  to  the  symmetry  axis  around  which  it  rotates. 
Equation  (  )  also  holds  for  a  collection  of  particles  rotating  about  a  symmetry  axis  through  their  center  of 

mass,  as  readily  follows  from  (  .13)  and  (5.23).  However,  it  does  not  hold  in  general,  as  in  general,  L  does  not 
have  to  be  parallel  to  co.  For  the  general  case,  we  need  to  consider  a  moment  of  inertia  tensor  I  (represented 
as  a  3  x  3  matrix)  and  write  L  -  I- u>.  We’ll  consider  this  case  in  more  detail  in  section 


5.7.  Conservation  of  angular  momentum 


Given  that  the  torque  is  the  rotational  analog  of  the  force,  and  the  angular  momentum  is  that  of  the  linear 
momentum,  it  will  not  come  as  a  surprise  that  Newton’s  second  law  of  motion  has  a  rotational  counterpart 
that  relates  the  net  torque  to  the  time  derivative  of  the  angular  momentum.  To  see  that  this  is  true,  we  simply 
calculate  that  time  derivative: 


d  L 
dt 


dr 


dp 


—  x  p+ r x  — 
df  dt 


■  T, 


(5.25) 


because  r  *  p  =  v  *  mv  =  0.  Some  texts  even  use  equation  (  )  as  the  definition  of  torque  and  work  from 

there.  Note  that  in  the  case  that  there  is  no  external  torque,  we  arrive  at  another  conservation  law: 


Theorem  5.3  (Law  of  conservation  of  angular  momentum) .  When  no  external  torques  act  on  a  rotating  object, 
its  angular  momentum  is  conserved. 

Conservation  of  angular  momentum  is  why  a  rolling  hoop  keeps  rolling,  and  why  a  balancing  a  bicycle  is 
relatively  easy  once  you  go  fast  enough. 

What  about  collections  of  particles?  Here  things  are  a  little  more  subtle.  Writing  L  =  £  ,•  L,-  and  again  taking 
the  derivative,  we  arrive  at 

^=Irixfi  =  £  Ti.  (5.26) 

^  i  i 

Now  the  sum  on  the  right  hand  side  of  (  ,26)  includes  both  external  torques  exerted  on  the  system,  and  in¬ 
ternal  torques  exerted  by  the  particles  on  each  other.  When  we  discussed  conservation  of  linear  momentum, 
the  internal  momenta  all  canceled  pairwise  because  of  Newton’s  third  law  of  motion.  For  torques  this  is 
not  necessarily  true,  and  we  need  the  additional  condition  that  the  internal  forces  between  two  particles  act 
along  the  line  connecting  those  particles  -  then  the  internal  torques  are  zero,  and  equation  ( 5.25)  holds  for  the 
collection  as  well.  Consequently,  if  the  net  external  torque  is  zero,  angular  momentum  is  again  conserved. 

5.8.  Rolling  and  slipping  motion 

When  you  slide  an  object  over  a  surface  (say,  a  book  over  a  table),  it  will  typically  slow  down  quickly,  due  to 
frictional  forces.  When  you  do  the  same  with  a  round  object,  like  a  water  bottle,  it  may  initially  slide  a  little 
(especially  if  you  push  it  hard),  but  will  quickly  start  to  rotate.  You  can  easily  check  that  when  rotating,  the 
object  looses  much  less  kinetic  energy  to  work  than  when  sliding  -  take  the  same  water  bottle,  either  on  its 
bottom  (sliding  only)  or  on  its  side  (a  little  sliding  plus  rolling),  push  it  with  the  same  initial  force,  and  let  go: 
the  rolling  bottle  gets  much  further.  However,  somewhat  ironically,  the  bottle  can  only  roll  thanks  to  friction. 
To  start  rolling,  it  needs  to  change  its  angular  momentum,  which  requires  a  torque,  which  is  provided  by  the 
frictional  force  acting  on  the  bottle. 

When  a  bottle  (or  ball,  or  any  round  object)  rolls,  the  instantaneous  speed  of  the  point  touching  the  sur¬ 
face  over  which  it  rolls  is  zero.  Consequently,  its  rotational  speed  to  and  the  translational  speed  of  its  center  of 
rotation  vT  (where  the  r  subscript  is  to  indicate  rolling)  are  related  by  vr  =  toR,  with  R  the  relevant  radius  of  our 
object.  If  the  object’s  center  of  rotation  moves  faster  than  vr,  the  rotation  can’t  ‘keep  up’,  and  the  object  slides 
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before 

follow  shot 

after 

e 

before 

draw  shot 

after 

O) 

0) 

CO 

0) 

Figure  5.1:  Five  types  of  billiard  shots,  (a-c)  The  type  of  motion  depends  on  where  the  cue  hits  the  ball,  (a)  If  the  cue  hits  the  ball  at 
exactly  7R/5  above  the  table,  the  ball  will  exhibit  pure  rolling  motion,  co  =  vR.  (b)  If  the  cue  hits  the  ball  above  the  critical  spot,  it  will 
rotate  faster  than  translate  (a)  >  vR)  and  exhibit  a  slipping  rotation.  Friction  will  slow  down  the  rotation  until  rolling  motion  is  achieved. 

(c)  If  the  cue  hits  the  ball  below  the  critical  spot,  it  will  translate  faster  than  rotate  {io  <  vR)  and  initially  slide,  until  friction  both  slows 
down  the  translational  speed  and  accelerate  the  rotational  speed  to  the  point  where  rolling  motion  is  achieved.  Note  that  the  rotational 
motion  may  even  be  retrograde,  i.e.,  backwards  compared  to  the  translational  motion,  (d-e)  Behavior  of  the  incident  billiard  ball  before 
and  after  collision  with  a  stationary  ball  of  equal  mass.  Since  the  collision  is  elastic,  all  linear  momentum  is  transferred  to  the  other  ball. 
If  the  incident  ball  was  initially  rolling,  immediately  after  the  collision  it  will  continue  rotating  with  complete  slipping.  Friction  then 
causes  the  ball  to  pick  up  linear  speed  again,  with  a  direction  depending  on  the  direction  of  the  rotational  motion,  resulting  in  a  follow 

(d)  or  draw  (e)  shot. 


5.8.  Rolling  and  slipping  motion 
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over  the  surface.  We  call  this  type  of  motion  slipping.  Due  to  friction,  objects  undergoing  slipping  motion 
typically  quickly  slow  down  to  vr,  at  which  point  they  roll  without  slipping. 

Suppose  we  started  our  object  with  a  velocity  vq.  If  there  is  no  rotation,  the  only  force  changing  its  veloc¬ 
ity  is  the  constant  frictional  force  /"irict  ion  =  Pk^N  =  PvJng,  with  m  the  mass  of  the  object  (equation  ).  The 
constant  force  results  in  a  linear  decrease  in  the  translational  speed  (see  section  2.3):  v{t)  -  vq  -  p^gt.  How¬ 
ever,  if  our  object  can  roll,  there  is  a  second  contribution  to  the  motion,  due  to  the  torque  TfrjCtjon  =  -Ffnction^ 
of  the  frictional  force.  Using  the  rotational  analog  of  Newton’s  second  law,  equation  (  )  (or  writing  L-  Id) 

and  using  equation  (  )),  we  get  an  equation  of  motion  for  the  rotational  velocity: 


dw 

l a  —  i  ,  —  T|riClion  —  ^friction  9. 

at 


(5.27) 


Integrating  equation  (  )  with  initial  condition  d)[t  -  0)  =  0  we  get  co(t)  =  p^mgRtl  I.  While  the  object 

undergoes  slipping  motion,  the  translational  speed  thus  linearly  decreases  with  time,  whereas  the  rotational 
speed  linearly  increases.  To  find  the  time  and  velocity  at  which  the  object  enters  a  purely  rolling  motion,  we 
simply  equate  v{t )  with  d>(t)R,  which  gives 


pkg{l  +  mR2II)’ 


1  1  +  I/(mR2) 

Note  that  the  time  fr  until  fully  rolling  motion  is  achieved  scales  inversely  with  the  friction  coefficient,  but  the 
final  rolling  speed  vr  is  independent  of  the  frictional  force.  The  rolling  speed  does  depend  on  the  moment  of 
inertia  of  your  object  -  for  a  hollow  cylinder  it’s  vt  =  ^  Vq,  whereas  for  a  solid  cylinder  it’s  vT  =  |  Vq.  Once  the 
object  is  rolling,  its  surface  no  longer  moves  with  respect  to  the  surface  that  it’s  rolling  on  (as  its  instantaneous 
speed  at  the  point  of  touching  is  zero).  Consequently,  the  frictional  force  is  much  reduced,  and  the  object  can 
roll  a  large  distance  before  it  stops;  in  fact,  the  main  force  slowing  it  down  once  it  is  rolling  is  drag  with  the 
ambient  air,  which  we  could  safely  ignore  when  (kinetic)  friction  was  still  in  the  picture. 

5.8.1.  Worked  example:  A  cylinder  rolling  down  a  slope 

A  massive  cylinder  with  mass  m  and  radius  R  rolls  without  slipping  down  a  plane  inclined  at  an  angle  9. 
The  coefficient  of  (static)  friction  between  the  cylinder  and  the  plane  is  p.  Find  the  linear  acceleration  of  the 
cylinder. 


(5.28) 

(5.29) 


Figure  5.2:  Free  body  diagram  of  a  cylinder  rolling  down  a  plane. 


Solution 

There  are  at  least  three  ways  to  tackle  this  problem.  For  all  three,  it  helps  (as  always)  to  make  a  sketch,  indi¬ 
cating  the  relevant  forces  -  see  figure  5.2. 
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Method  1:  Forces  and  torques.  Let  the  friction  force  F\  be  positive  in  the  direction  up  the  plane.  Then 
we  have: 


F  —  ma  =>  mgsin6-F{-ma 

1 


F{R  =  -  mR  a 
2 


t  -  la 

no  slipping  =>  a  —  aR. 

The  last  two  equations  give  Ff  =  |  ma.  Plugging  this  into  the  first  equation  gives 

gsind  2 


l+i  3 


gsind. 


Method  2:  Energy.  The  total  energy  of  the  system  is  given  by 

1  n  1  o 

Ftot  =  K+V  —  -  m  v  +  -  la)  +  mgh. 


If  the  cylinder  rolls  down  the  slope  without  slipping,  its  angular  and  linear  velocities  are  related  through 
v  =  coR.  Also,  if  it  moves  a  distance  Ax,  its  height  decreases  by  Ax  •  sin  (7.  Conservation  of  energy  then 
gives: 


0  = 


riFtot 
d  t 


d 
d  t 


1  ?1(V\2 

-mv  +  —  j  -mgxsinO 


=  mvv+  I  — —  -  mgvsinO 
R2 

a+  ~a~  g sind 

where  we  used  I  -  \  mR2  for  a  massive  cylinder  in  the  last  line.  The  linear  acceleration  a  is  thus  given 
by  a  -  |gsind. 

Method  3:  Rotational  version  of  Newton’s  second  law.  At  a  given  point  in  time,  we  can  apply  the  ro¬ 
tational  version  of  Newton’s  second  law  to  rotations  about  the  point  where  the  cylinder  touches  the 
surface  (as  the  cylinder  is  rolling  without  slipping,  this  is  the  only  motion  at  that  point).  Of  the  three 
forces  in  the  system,  two  act  at  that  point,  so  they  have  no  lever  arm.  Only  gravity  has  a  nonzero  lever 
arm  of  length  R  sin  9,  leading  to  a  torque  given  by  tz  =  mg  R  sin  0.  By  the  rotational  version  of  Newton’s 
second  law,  we  have  t  =  la,  where  I  is  the  moment  of  inertia  about  the  pivot.  Applying  the  parallel-axis 
theorem,  we  find  I  =  Icm  +  md2  =  |  mR2  in  this  case,  so  we  get  an  angular  acceleration  of 


tz  mgRsind  2 g  . 

a  -  —  =  — 5 - =  —  stntL 

I  I mR 2  3 R 


The  linear  acceleration  of  the  center  of  the  cylinder  due  to  the  ‘rotation’  about  this  pivot  is  given  by 
a  —  Ra  —  |gsin0. 


5.9.  Precession  and  nutation 

The  action  of  a  torque  causes  a  change  in  angular  momentum,  as  expressed  by  equation  ).  A  special  case 
arises  when  the  torque  is  perpendicular  to  the  angular  momentum:  in  that  case  the  change  affects  only  the 
direction  of  the  angular  momentum  vector,  not  its  magnitude.  Since  the  torque  is  given  by  the  cross  product 
of  the  arm  and  the  force,  this  case  arises  when  the  angular  momentum  is  parallel  to  either  arm  or  force,  or 
more  generally,  lies  in  the  plane  spanned  by  the  force  and  arm.  As  a  result,  the  angular  momentum  vector 
may  start  rotating  about  a  fixed  axis,  a  process  known  as  precession.  Due  to  the  action  of  a  second  force  (with 
associated  torque),  the  angle  between  the  angular  momentum  vector  and  the  fixed  axis  (which  we’ll  call  the 
z-axis)  may  also  change,  a  process  known  as  nutation. 

The  simplest  example  of  a  precessing  system  is  that  of  a  rotationally  symmetric  top,  spinning  about  an 
axis  that  is  not  the  vertical  z-axis,  see  figure  5.3.  In  this  case,  the  arm  of  gravitational  force  (pointing  from  the 
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Figure  5.3:  Precession  of  a  spinning  top.  The  top  rotates  about  an  axis  which  makes  an  angle  ip  with  the  vertical  (z)  axis.  The  angular 
momentum  of  the  top  is  parallel  to  the  rotation  vector.  The  arm  between  the  pivot  (where  the  top  touches  the  supporting  surface)  and 
the  center  of  mass  is  parallel  to  the  angular  momentum  as  well.  Consequently,  the  torque  of  the  gravitational  force  (which  as  always  is 
pointed  downward  from  the  center  of  mass)  is  perpendicular  to  the  angular  momentum,  and  causes  it  to  precess  about  the  z-axis  with  a 
precession  frequency  oip. 


pivot  at  the  origin  to  the  center  of  mass  of  the  top)  is  parallel  to  the  angular  momentum,  which  itself  is  parallel 
to  the  rotation  vector,  as  L  -  Ia>.  The  torque  of  gravity  is  thus  perpendicular  to  the  angular  momentum. 
If  we  call  the  angle  between  L  and  the  z-axis  (p ,  and  work  in  cylindrical  coordinates  (p,0,z),  we  can  write 
L  =  Lxyp  +  Lzz,  where  Lxy  —  Lsiiup  is  the  projection  of  L  on  the  xy-plane,  and  p  is  the  radial  unit  vector  in 
the  xy-plane  (i.e.,  p  =  cos 0 Jr  +  sindy).  The  gravitational  torque  is  then  given  by: 

tz  =  r  x  Fz  -  mgr  sin <p§,  (5.30) 

where  r  is  the  arm  pointing  from  the  origin  to  the  center  of  mass,  r  its  length,  and  0  the  angular  unit  vector 
in  the  xy-plane  (i.e.,  0  —  -  sin  Ox  +  cosOy).  For  the  time  derivative  of  L,  we  get: 


dL 

df 


d  Lxy 

df 


■  A  dLz 

P  +  LXydO  +  ^  Z, 


where  we  used  (see  equation  ) 


dp  dp  d 6 
df  d0  df 


Equating  (  )  and  (  ),  we  find  that  dLXy/df  =  dLz/df  =  0,  and 

_  d0  mgr  sin <p  mgr 

p  df  LXy  1(1) 


(5.31) 


(5.32) 


(5.33) 


Equation  (  )  gives  us  the  frequency  of  the  precession  about  the  z-axis.  The  associated  precession  rotation 

vector  is  given  by  wp  =  <bpz. 

Where  precession,  in  terms  of  the  angles  used  in  this  section,  represents  a  change  in  9,  nutation  is  asso¬ 
ciated  with  a  change  in  the  tilt  angle  <p.  It  comes  about  in  several  cases,  most  commonly  due  to  the  action  of 
a  second  force,  and  often  results  in  a  periodic  motion  (hence  the  name  nutation,  ‘nodding’).  One  example  is 
the  change  in  the  Earth’s  axis.  As  the  axis  is  not  perpendicular  to  the  plane  of  Earth’s  orbit  (it  presently  makes 
an  angle  of  about  23.4°),  gravity  from  the  sun  exerts  a  net  torque  on  the  Earth’s  rotation  axis,  causing  it  to 
precess  with  a  period  of  about  25000  years.  Consequently,  the  current  polar  star  will  only  remain  the  polar 
star  for  a  couple  of  centuries.  Due  to  the  torques  exerted  by  the  moon  and  the  other  planets,  the  Earth’s  axis 
also  nutates  -  with  amplitudes  from  arcseconds  to  a  few  degrees,  and  periods  that  range  widely,  from  18.6 
years  due  to  the  gravitational  pull  of  the  moon  up  to  millennia  due  to  other  effects. 
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5.10.  Problems 

5.1  Figure  5.4  shows  two  hand  powered  drills. 

(a)  With  which  of  the  two  drills  shown  in  figure  will  you  be  able  to  exert  a  greater  torque  on  the  drill 
bit? 

(b)  With  which  of  the  two  drills  shown  in  figure  5.4  will  you  be  able  to  exert  a  greater  rotational  speed? 


Figure  5.4:  Two  hand-powered  drills,  (a)  A  brace  [16],  CC  BY-SA  3.0.  (b)  An  egg-beater  drill  [17]. 


5.2  An  Atwood’s  machine  consists  of  two  masses  m\  and  m2,  connected  by  a  string  that  passes  over  a  pulley. 
If  the  pulley  is  a  disk  of  radius  R  and  mass  M,  find  the  acceleration  of  the  masses. 

5.3  You  hold  a  uniform  pen  with  a  mass  of  25.0  g  horizontal  with  your  thumb  pushing  down  on  one  end  and 
your  index  finger  pushing  upward  2.0  cm  from  your  thumb.  The  pen  is  14  cm  long. 

(a)  Which  of  your  fingers  exerts  the  greater  force? 

(b)  Find  the  two  forces. 

5.4  A  wagon  wheel  is  constructed  as  shown  in  the  figure  below.  The  radius  of  the  wheel  is  R.  Each  of  the 
spokes  that  lie  along  the  diameter  has  a  mass  m,  and  the  rim  has  mass  M  (you  may  assume  the  thickness 
of  the  rim  and  spokes  are  negligible  compared  to  the  radius  R). 


(a)  What  is  the  moment  of  inertia  of  the  wheel  about  an  axis  through  the  center,  perpendicular  to  the 
plane  of  the  wheel  (figure  a)? 

(b)  For  the  same  wheel,  what  is  the  moment  of  inertia  for  an  axis  through  the  center  and  two  of  the 
spokes,  in  the  plane  of  the  wheel  (figure  b)? 

5.5  A  sphere  with  radius  R  -  0.200  m  has  a  density  p  that  decreases  with  distance  r  from  the  center  of  the 
sphere,  according  to  p{r)-  a-  br,  where  a  —  1.00- 103  kg/m3  and  b  —  4.00  •  103  kg/m4. 

(a)  Calculate  the  total  mass  of  the  sphere. 

(b)  Calculate  the  moment  of  inertia  of  the  sphere  for  an  axis  through  its  center. 

5.6  A  0.10  kg  yo-yo  has  an  outer  radius  R  that  is  5.0  times  greater  than  the  radius  r  of  its  axle.  The  yo-yo  is 
in  equilibrium  if  a  mass  m  is  suspended  from  its  outer  edge,  as  shown  in  figure  5.5.  Find  the  tensions  T\ 
and  T2  in  the  two  strings,  and  the  mass  m. 

5.7  A  table  consists  of  three  pieces:  a  tabletop  which  is  a  circular  disk  of  radius  R,  thickness  d  —  A’/ 50,  and 
mass  M;  a  single  leg  that  supports  the  tabletop  in  its  center  and  consists  of  a  hollow  cylinder  of  height 
h  -  Rl 2  and  mass  m  =  Ml 5,  and  a  foot,  which  consists  of  a  solid  cylinder  of  radius  Rl 3,  mass  M  and 
height  h  =  Rl 4. 
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Figure  5.5:  A  yo-yo  with  a  mass  suspended  from  its  outer  edge. 


(a)  Find  the  position  of  the  center  of  mass  of  this  table. 

(b)  With  what  force  should  you  push  down  on  the  edge  of  the  table  to  make  it  tip  over? 

(c)  A  stone  of  mass  m  =  M/2  is  placed  on  the  table.  How  far  out  from  the  center  can  it  be  positioned 
before  the  table  tips  over?  You  may  approximate  the  stone  as  a  point  mass. 

5.8  A  set-square  protractor  made  of  plastic  with  density  p  —  1.2  g/cm3,  thickness  2.0  mm  and  sides  of  10  cm 
is  placed  in  a  coordinate  system  as  shown  in  the  figure.  The  z-axis  (not  shown)  is  coming  out  of  the  paper. 

a  b 


(a)  Find  the  position  of  the  center  of  mass  of  the  protractor. 

(b)  Find  the  moment  of  inertia  of  the  protractor  when  rotating  it  along  the  y-axis. 

(c)  Find  the  moment  of  inertia  of  the  protractor  when  rotating  it  along  the  z-axis  (i.e.,  the  axis  through 
the  origin  and  perpendicular  to  the  xy-plane). 

(d)  Find  the  moment  of  inertia  of  the  protractor  when  rotating  it  along  an  axis  through  its  center  of  mass, 
and  parallel  to  the  z-axis  (i.e.,  perpendicular  to  the  xy-plane). 

(e)  You  pick  up  the  protractor  and  balance  it  on  your  finger  in  its  center  of  mass.  You  then  tap  one  of  the 
points  with  a  small  force  F,  directed  in  the  plane  of  the  protractor.  For  which  of  the  three  forces  shown 
in  figure  b  is  the  rotational  speed  the  protractor  gets  the  largest?  (As  always,  explain  your  answer;  the 
magnitude  of  all  three  forces  is  the  same,  the  three  indicated  angles  are  all  45°). 
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(f)  You  bring  in  three  friends,  who  now  push  on  the  protractor  along  the  directions  indicated  by  fq,  F2 
and  F3  in  the  figure.  If  F\  and  P2  are  both  1.0  N,  how  large  must  F2l  be  so  that  the  protractor  does  not 
move? 

5.9  In  this  problem,  we  consider  an  object  which  consists  of  a  frame  with  four  balls,  as  depicted  in  the  figure 
below.  Each  of  the  balls  has  a  radius  R  —  5.0  cm.  Their  masses  are  m,\  =  3.0  kg,  ms  =  3.0  kg,  me  =  2.0  kg 
and  mo  -  1.0  kg.  The  rods  of  the  frame  (which  run  from  the  edge  of  one  ball  to  the  edge  of  the  next)  have 
lengths  L\  -  20  cm  and  L2  =  30  cm  and  a  linear  density  A  =  10  g/cm.  (The  distance  between  the  centers 
of  balls  A  and  B  is  thus  30  cm,  and  between  the  centers  of  balls  B  and  C  is  40  cm).  The  thickness  of  the 
rods  is  negligible.  We  use  the  xyz  frame  indicated  in  the  figure,  with  a  z  axis  coming  out  of  the  paper  in 
the  origin. 
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(a)  Find  the  position  of  the  center  of  mass  of  the  object. 

(b)  Find  the  moment  of  inertia  of  the  object  with  respect  to  rotations  about  the  y-axis. 

(c)  We  apply  a  torque  of  10  Nm  to  set  the  object  spinning  about  the  y-axis.  Calculate  the  magnitude  of 
the  resulting  angular  velocity. 

(d)  Find  the  moment  of  inertia  of  the  object  with  respect  to  rotations  about  an  axis  through  its  center  of 
mass,  and  parallel  to  the  y-axis. 

(e)  Find  the  moment  of  inertia  of  the  object  with  respect  to  rotations  about  an  axis  through  its  center  of 
mass,  and  parallel  to  the  z-axis. 

5.10  Three  uniform  bars  with  linear  density  (i.e.,  mass  per  unit  length)  A  are  welded  together  into  the  shape  of 

an  isosceles  triangle  with  sides  of  length  5  L,  5  L,  and  6  L  (figure  a). 


The  moment  of  inertia  of  a  rod  with  mass  m  and  length  L  about  an  axis  that  goes  through  one  of  its  end 
points  at  an  angle  6  with  the  rod  itself  (figure  b)  is  given  by 


(a)  Find  the  position  of  the  center  of  mass  of  the  triangle. 

(b)  Find  the  moment  of  inertia  of  the  triangle  about  its  axis  of  symmetry.  To  do  so,  you  may  use  the  given 
expression  for  the  moment  of  inertia  of  a  bar  at  an  angle. 

(c)  Find  the  moment  of  inertia  of  the  triangle  about  its  longest  side. 

(d)  Find  the  moment  of  inertia  of  the  triangle  about  the  axis  through  its  center  of  mass,  perpendicular  to 
the  triangle’s  plane.  Use  the  theorems  proved  in  this  section  to  do  so. 
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(e)  Are  rotations  of  this  triangle  about  its  symmetry  axis  stable? 

(f)  [Challenging]  Derive  the  given  expression  for  the  moment  of  inertia  of  the  rod  under  an  angle. 

5.11  We  consider  the  same  setting  as  in  the  worked  example  in  section  5.8.  A  massive  cylinder  with  mass  m 
and  radius  R  rolls  without  slipping  down  a  plane  inclined  at  an  angle  6  (see  figure  a).  The  coefficient  of 
(static)  friction  between  the  cylinder  and  the  plane  is  p. 


(a)  Find  the  largest  angle  6  for  which  the  cylinder  doesn’t  slip. 

(b)  Suppose  now  that  we  accelerate  the  plane  upwards  (along  its  direction)  with  acceleration  a,  see  fig¬ 
ure  b.  For  what  value  of  a  does  the  center  of  mass  of  the  cylinder  not  move?  (You  may  assume  that 
the  cylinder  still  isn’t  slipping). 

5.12  We  consider  a  wheel  with  mass  m  and  radius  R  which  rolls  without  slipping  on  a  flat  (problems  a  and  b) 
and  an  inclined  (problems  c-i)  plane. 


(a)  Draw  a  free-body  diagram  of  the  wheel  rolling  on  a  horizontal  surface  (figure  a).  Also  indicate  the 
(linear)  velocity  vector  v  of  the  wheel. 

(b)  How  much  work  is  done  by  the  friction  force  on  the  wheel?  What  effect  does  this  work  have  on  the 
linear  and  angular  velocity  of  the  wheel? 

We  now  consider  the  case  that  the  wheel  rolls  up  an  inclined  plane  (angle  a,  see  figure  b)  with  initial 

velocity  vq. 

(c)  Again  draw  a  free-body  diagram  of  the  wheel  and  indicate  the  linear  velocity  v. 

(d)  Determine  the  equation  of  motion  for  both  the  linear  and  the  rotational  velocity  of  the  wheel  (i.e. 
apply  Newton’s  second  law  to  the  translational  and  rotational  motion  ot  the  wheel) .  For  this  problem, 
you  can  describe  the  wheel  as  a  cylindrical  ring  (and  thus  ignore  the  mass  of  the  spokes). 

(e)  Solve  the  equations  of  motion  from  (12d)  to  obtain  the  linear  acceleration  of  the  wheel. 
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(f)  From  the  equations  of  motion,  determine  the  magnitude  and  the  direction  of  the  frictional  force 
acting  on  the  wheel. 

(g)  Determine  the  total  energy  of  the  wheel. 

(h)  Take  the  derivative  of  the  total  energy  of  problem  ( 12g)  to  again  arrive  at  an  equation  of  motion  for 
the  wheel. 

(i)  Is  the  energy  of  the  wheel  rolling  up  the  incline  conserved?  Does  the  frictional  force  do  any  work  in 
this  case? 

5.13  We  consider  a  solid  cone  of  uniform  density  p,  height  h  and  base  radius  R. 


a)  A 


h 


R 


Figure  5.6:  Sketch  of  the  cone,  (a)  Dimensions,  (b)  Tilt  angle  6. 

(a)  Taking  the  origin  to  be  at  the  center  of  the  base,  and  the  symmetry  axis  along  the  z  (vertical)  axis, 
show  that  the  center  of  mass  of  the  cone  is  located  at  (0, 0,  III  4).  Hint:  consider  the  cone  as  a  stack  of 
disks  and  pick  your  origin  carefully. 

(b)  Over  what  maximum  angle  can  you  tilt  the  cone  such  that  it  just  doesn’t  fall  over? 

(c)  Qualitatively  sketch  the  potential  energy  landscape  U{9)  of  the  cone  as  a  function  of  this  tilt  angle  9. 
Indicate  the  stationary  points  and  their  stability.  Assume  the  cone  is  narrow,  i.e.,  the  radius  of  the 
base  is  less  than  a  quarter  of  the  height. 

(d)  Going  back  to  the  upright  position,  show  that  the  moment  of  inertia  of  the  cone  around  its  axis  of 
symmetry  is  given  by  I  —  ( 3I\0)MR 2,  where  M  is  the  total  mass  of  the  cone. 

(e)  Consider  a  cone  which  is  100  cm  high,  has  a  base  radius  of  20  cm,  and  a  mass  density  of  700  kg/m3. 
If  the  cone  is  standing  with  its  base  on  a  perfectly  slippery  surface  (no  friction) ,  and  you  want  to  set  it 
spinning  at  1  revolution  per  minute,  starting  from  rest,  how  much  work  do  you  have  to  do? 

5.14  A  hockey  stick  of  mass  ms,  length  L,  and  moment  of  inertia  (with  respect  to  its  center  of  mass)  Iq  is  at 
rest  on  the  ice  (which  we  assume  to  be  frictionless) .  A  puck  with  mass  mp  hits  the  stick  a  distance  D  from 
the  middle  (i.e.,  center  of  mass)  of  the  stick.  Before  the  collision,  the  puck  was  moving  with  speed  uq  in  a 
direction  perpendicular  to  the  stick,  as  indicated  in  the  figure.  The  collision  is  completely  inelastic,  and 
the  puck  remains  attached  to  the  stick  after  the  collision. 


a) 


b) 


center  of  mass 
stick 


center  of  mass 
stick 


D 


center  of  mass 
stick  +  puck 
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(a)  Find  the  speed  Vf  of  the  center  of  mass  of  the  stick  +  puck  combination  after  the  collision,  in  terms  of 
Vo,  m.p,  ms,  and  L. 

(b)  After  the  collision,  the  stick  and  puck  will  rotate  about  their  combined  center  of  mass,  which  lies  a 
distance  b  from  the  center  of  mass  of  the  stick.  Find  b. 

(c)  What  is  the  angular  momentum  Lcm  of  the  system  before  the  collision,  with  respect  to  the  center  of 
mass  of  the  final  system?  Use  your  answer  at  (b)  to  eliminate  the  value  of  b  from  your  answer. 

(d)  What  is  the  angular  velocity  <i>  of  the  stick  +  puck  combination  after  the  collision?  Your  answer  should 
be  independent  of  the  variable  b. 

(e)  Are  the  kinetic  energy,  linear  momentum,  and  angular  momentum  conserved  in  this  collision? 

5.15  We  consider  the  same  situation  as  in  problem  .  ,  but  now  the  collision  is  fully  elastic.  Find  the  velocity 

of  the  ball  after  the  collision,  as  well  as  the  translational  and  rotational  velocity  of  the  stick  after  the 
collision.  Hint:  You  have  to  solve  for  three  unknowns,  so  you  need  three  equation  -  what  are  they? 

5.16  In  Greek  mythology  Sisyphus  by  life  was  the  king  of  Ephyra  (Corinth).  After  he  died,  he  was  punished 
for  his  self- aggrandizing  craftiness  and  deceitfulness  in  Tartarus  (Greek  hell)  by  being  forced  to  roll  an 
immense  boulder  up  a  hill  only  for  it  to  roll  down  when  it  nears  the  top,  repeating  this  action  for  all 
eternity.  We  take  the  original  rock  to  have  a  diameter  of  1.00  m  and  a  mass  of  2000  kg;  the  hill  is  1.00  km 
high  and  has  a  slope  of  30°. 

(a)  Suppose  Sisyphus  somehow  managed  to  replace  this  rock  with  one  made  of  aerogel  with  a  mass  of 
only  1.0  kg  covered  by  a  thin  layer  of  rock  which  also  has  a  mass  of  1.0  kg.  To  hide  his  deceit,  he  still 
walks  uphill  slowly,  and  kicks  the  rock  when  it  ‘slips’  down  again.  What  speed  must  Sisyphus  give  the 
fake  rock  such  that  it  has  the  same  speed  at  the  bottom  as  the  real  rock  would  have?  You  may  assume 
that  both  rocks  roll  all  the  way  down  without  slipping. 

(b)  Hades,  being  not  amused  with  Sisyphus’  trickery  in  part  (a),  decides  to  repay  him  using  his  own  mea¬ 
sures.  He  sends  Sisyphus  off  to  Helheim  (Norse  hell),  strapped  to  the  original  rock,  which  he  kicks 
so  hard  that  it  reaches  escape  velocity  (11.2  km/s)  while  also  spinning  at  a  rate  of  1.0  revolutions  per 
second.  How  much  energy  did  Hades  put  into  the  rock  (you  may  ignore  Sisyphus’  mass,  he’s  a  ghost 
anyway)? 

(c)  Hel,  the  ruler  of  Helheim,  doesn’t  want  him  either,  so  she  picks  up  the  capstone  of  a  dolmen,  with 
a  mass  ten  times  that  of  Sisyphus’  rock,  and  hurls  it  towards  the  oncoming  Sisyphus.  As  it  happens, 
Hel  throws  just  hard  enough  that  after  the  collision,  Sisyphus  (now  stuck  between  the  two  rocks)  is 
spinning  in  place.  How  fast  did  Hel  throw  her  stone? 

(d)  At  what  angular  velocities  do  Sisyphus  and  his  two  rocks  spin  after  the  collision? 

(e)  Dante  finally  finds  a  place  for  Sisyphus  in  Inferno  (Christian  hell).  To  get  him  there,  he  first  needs  to 
remove  the  capstone.  Taking  pity  on  Sisyphus,  Dante  wishes  to  do  so  in  a  way  that  the  original  rock 
stops  spinning.  He  stands  on  Sisyphus’  rock,  pushing  on  the  capstone.  What  torque  must  he  exert  on 
that  stone  to  stop  Sisyphus’  rock? 

(f)  Dante  then  ties  a  100  km  long  rope  to  the  rock,  which  at  the  other  end  is  connected  to  Lucifer  (down 
at  the  center  of  Inferno).  Lucifer  then  hauls  in  the  rope,  with  a  speed  of  1.0  m/s.  Unfortunately 
for  Sisyphus,  the  removal  of  the  capstone,  although  it  has  stopped  him  from  spinning,  has  resulted 
in  him  getting  a  small  linear  velocity  of  1.0  cm/s  in  the  direction  perpendicular  to  the  rope.  When 
Lucifer  has  hauled  in  enough  rope  to  put  Sisyphus  in  the  fourth  circle  of  Inferno  at  about  5  km  from 
himself,  what  is  the  angular  velocity  of  Sisyphus’  rock? 

5.17  Two  children,  with  masses  mi  -  10  kg  and  m2  —  10  kg  sit  on  a  simple  merry-go-round  that  consists  of 
a  massive  disk  with  a  mass  of  100  kg  and  a  radius  of  2.0  m.  The  disk  can  rotate  freely  about  its  center, 
and  is  doing  so  at  a  frequency  cjq  of  5.0  revolutions  per  minute.  A  third  child  with  mass  m3  =  10  kg  runs 
towards  the  merry-go-round  with  a  speed  vq  of  1.0  m/s,  at  an  angle  of  30°  with  the  tangent  to  the  merry- 
go-round’s  edge  (see  figure).  Once  it  reaches  the  merry-go-round,  the  child  jumps  on  it,  and  continues 
rotating  with  the  other  two.  Find  the  resulting  rotational  velocity  of  the  merry-go-round  with  the  three 
children. 
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General  planar  motion 


Although  Newton’s  laws  of  motion,  the  various  force  laws,  and  the  three  conservation  laws  we  have  derived, 
are  all  valid  in  three  dimensions,  we  have  so  far  restricted  our  study  of  motion  almost  exclusively  to  two 
special  cases:  linear  motion  in  one  dimension,  and  rotational  motion  in  a  plane,  where  the  radius  of  the 
rotation  is  constant.  Although  for  the  second  case  we  do  need  two  directions  to  describe  it,  the  motion  itself 
is  constricted  to  a  circle,  and  thus  essentially  one-dimensional.  In  this  section,  we’ll  look  at  general  motion  in 
a  plane  -  which  turns  out  to  capture  a  large  number  of  important  nontrivial  cases. 


6.1.  Projectile  motion 

The  simplest  case  of  two-dimensional  motion  occurs  when  a  particle  experiences  a  force  only  in  one  direc¬ 
tion.  The  prime  example  of  this  case  is  the  motion  of  a  projectile  in  Earth’s  (or  any  other  planet’s)  gravitational 
field  as  locally  described  by  Galilean  gravity  (equation  2.8):  F  -  mg.  Once  a  projectile  has  been  fired  with  a 
certain  initial  velocity  vq,  we  can  find  its  trajectory  by  solving  the  equation  of  motion  that  follows  from  New¬ 
ton’s  second  law:  mg  —  mr.  We  can  decompose  r  and  vq  in  horizontal  (x)  and  vertical  (z)  components; 
each  of  them  has  its  own  one-dimensional  equation  of  motion,  which  we  already  solved  in  section  2.3.  The 
horizontal  component  experiences  no  force  and  thus  executes  a  simple  linear  motion  with  uniform  velocity 
vq  cos  0o,  where  9q  -  arccos(i/o-x)/fo  is  the  angle  with  the  horizontal  under  which  the  projectile  was  fired  and 
Vo  -  |  i/o  I  the  initial  speed.  Likewise,  because  the  acceleration  due  to  gravitation  is  constant,  our  projectile  will 
execute  a  uniformly  accelerated  motion  in  the  vertical  direction  with  initial  velocity  vq  sin0o.  If  the  projectile’s 
initial  position  is  [xq,  zq),  its  motion  is  thus  described  by: 


r(f)  = 


x(t) 

z(f) 


X0  )  [  COS0O 

z0  J  V°{  sin  0O 


(6.1) 


6.2.  General  planar  motion  in  polar  coordinates 


Although  in  principle  all  planar  motion  can  be  described  in  Cartesian  coordinates,  they  are  not  always  the 
easiest  choice.  Take,  for  example,  a  central  force  field  (a  force  field  whose  magnitude  only  depends  on  the 
distance  to  the  origin,  and  points  in  the  radial  direction),  as  we’ll  study  in  the  next  section.  For  such  a  force 
field  polar  coordinates  are  a  more  natural  choice  than  Cartesians.  However,  polar  coordinates  do  carry  a  few 
subtleties  not  present  in  the  Cartesian  system,  because  the  direction  of  the  axes  depends  on  position.  We  will 
therefore  first  derive  the  relevant  expressions  for  the  position,  velocity  and  acceleration  vector,  as  well  as  the 
components  of  the  force  vector,  in  polar  coordinates  for  the  general  case. 

As  we  already  know  (see  appendix  A.2) ,  the  position  vector  7  —  xx+yy  has  a  particularly  simple  expression 
in  polar  coordinates:  r  =  rr ,  where  r  =  \J  x2  +  y2 .  To  find  the  velocity  and  acceleration  vectors  in  polar  coor¬ 
dinates,  we  take  time  derivatives  of  r.  Note  that  because  the  orientation  of  the  polar  basis  vectors  depends 
on  the  position  in  space,  the  time  derivative  acts  on  both  the  distance  to  the  origin  r  and  the  basis  vector  r . 
Because  the  two  polar  basis  vectors  are  each  other’s  derivatives  with  respect  to  0  (see  equation  A.8),  we  find 
for  their  time  derivatives: 


dr  _  dr  d0  _  ^  dO  _  d 0  d0 
d t  d0  df  ’  df  d0  df 


(6.2) 
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For  the  velocity  and  acceleration  vectors  we  then  find: 

dr 

v=  —  -  rr  +  rdO,  (6.3) 

df 

dv  -o  ••  -a 

a=  —  =  {r-rez)r  +  (rQ  +  2rd)0.  (6.4) 

dr 

Note  that  equations  (5.3)  and  (5.6)  are  the  special  cases  of  equations  (6.3)  and  [6/  )  for  which  both  the  radius  r 
and  the  angular  velocity  a>  —  9  are  constant. 

Using  equation  (  )  for  r  in  Newton’s  second  law,  we  get  an  expression  decomposing  the  net  force  F  into 

a  radial  and  an  angular  part,  each  of  which  consists  of  two  terms: 

F=mr  =  Frr  +  FgO  (6.5) 

Fr  =  m[r  -  r62)  (6.6) 

Fg  =  m{r6  +  2r0)  (6.7) 

The  two  terms  in  Fr  are  readily  identified  as  the  radial  acceleration  r  (acting  along  the  line  through  the  ori¬ 
gin)  and  the  centripetal  force  (which  causes  objects  to  rotate  around  the  origin,  see  equation  (  )).  The 

first  term  rd  in  Fg  is  the  tangential  acceleration  a  of  a  rotating  object  whose  angular  velocity  is  changing 
(equation  (  8)).  The  last  term  in  Fg  we  have  not  encountered  before;  it  is  known  as  the  Coriolis  force 

FCor  =  2  mrdO,  (6.8) 

and  is  associated  with  a  velocity  in  both  the  radial  and  the  angular  direction.  It  is  fairly  weak  on  everyday 
length  scales,  but  gets  large  on  global  length  scales.  In  particular,  if  you  move  over  the  surface  of  the  Earth 
(necessarily  with  a  nonzero  angular  component  of  your  velocity),  it  tends  to  deflect  you  from  a  straight  path. 
On  the  Northern  hemisphere,  if  you  move  horizontally,  it  tends  to  push  you  to  the  right;  it  also  pushes  you 
west  when  going  up,  and  east  when  going  down.  Coriolis  forces  are  responsible  for  the  rotational  movement 
of  air  around  high  and  low  pressure  zones,  causing  respectively  clockwise  and  counterclockwise  currents 
around  them  on  the  Northern  hemisphere  (see  figure  ).  We’ll  encounter  the  Coriolis  force  again  in  the 
more  general  three-dimensional  setting  in  section 


Figure  6.1:  The  Coriolis  force  causes  clockwise  and  counterclockwise  currents  around  high  and  low  pressure  zones  on  the  Northern 
hemisphere,  (a)  Pressure  gradient  (blue),  Coriolis  force  (red)  and  resulting  air  flow  (black)  around  a  low  pressure  zone,  (b)  Typical 
satellite  picture  of  a  low-pressure  zone  and  associated  winds  over  Iceland.  Picture  by  NASA’s  Aqua/MODIS  satellite  [20]. 


6.3.  Motion  under  the  action  of  a  central  force 

A  central  force  is  a  force  that  points  along  the  (positive  or  negative)  radial  direction  r ,  and  whose  magnitude 
depends  only  on  the  distance  r  to  the  origin  -  so  F(r)  =  F(r)f.  Central  forces  can  be  defined  in  both  two 
and  three  dimensions,  with  the  three-dimensional  concept  of  the  radial  distance  (to  the  origin)  and  direc¬ 
tion  (direction  of  increasing  r)  completely  analogous  to  the  two-dimensional  case.  Two  important  examples 
of  central  forces  are  (general)  Newtonian  gravity  (2.9)  and  the  Coulomb  force  (2.1  )  between  two  charged 
objects.  Although  these  forces  are  three-dimensional  examples,  discussing  them  here  is  appropriate,  as  the 
following  theorem  shows. 


6.3.  Motion  under  the  action  of  a  central  force 
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Theorem  6.1.  The  motion  of  a  particle  under  the  action  of  a  central  force  takes  place  in  a  plane. 

Proof  We  first  note  that  a  central  force  can  exert  no  torque  on  an  object:  t  =  r  x  F  =  E(r)(r  x  r)  =  0.  Con¬ 
sequently,  under  the  action  of  a  central  force,  angular  momentum  is  conserved.  Moreover,  we  have  r  ■  L- 
r  •  (r  x  p)  =  0  and  v  ■  L  =  v  ■  (r  x  mv)  =  0.  Both  the  position  vector  r  and  the  velocity  vector  v  thus  lie  in  the 
plane  perpendicular  to  L.  As  L  is  conserved  r  and  v  must  be  confined  to  the  plane  perpendicular  to  L  and 
through  the  origin.  tj 


Applying  the  results  of  the  previous  section  to  the  motion  of  a  single  particle  under  the  action  of  a  central 
force,  we  find  (for  the  plane  in  which  the  particle  moves): 


F[r)  —  Fr  —  mr-  mrd2  -  mr  - 


L2 

mr 3  ’ 


(6.9) 


where  we  used  that  for  a  single  particle,  the  magnitude  of  the  angular  momentum  is  given  by  L  -  mr2B. 
Rewriting  equation  (  )  gives 

L2 


mr  =  F{r)  + - ?  =  F(r)  +  Fc{,  (6.10) 

mri 

where  Fc  f  is  known  as  the  centrifugal  force,  as  it  tends  to  move  our  particle  away  from  the  origin.  We  can  write 
the  centrifugal  force  as  the  derivative  of  a  potential: 


dt/rf  _  d  l  L2  \ 
dr  dr  \2mr2 ) 


(6.11) 


Writing  the  original  force  as  the  derivative  of  a  potential  U{r)  as  well,  we  can  write  down  an  equation  for  the 
total  energy  of  the  system: 


E=K+U=  -mr  +  U(r)  + - T. 

2  2  mr2 


(6.12) 


For  both  Newtonian  gravity  and  the  Coulomb  force,  the  potential  can  be  written  as  U{r)  =  -air,  where 
a  =  Gni\m2  for  gravity  and  a  -  - ke q \  q2  for  Coulomb’s  law.  We  can  then  rewrite  the  energy  equation  as  a 
differential  equation  for  r(t): 


1  (dr 
-  m  — 

2  { dt 


L2 

2  mr2 


(6.13) 


To  describe  the  motion  of  the  particle,  rather  than  specifying  r(f)  and  0(f),  we  would  like  to  express  r  as  a 
function  of  0.  We  can  rewrite  equation  (  )  to  a  differential  equation  for  r(0)  by  invoking  the  chain  rule: 


(dr)2  (dr  d0)2_(dr)2(  L  )2 

[dfj  \d0  dfj  1  d0 J  [mr2J 

where  we  again  used  that  L  =  mr20.  Equation  (  13)  now  becomes: 

I  1  drj2  1  2  ma  2  mE 

l^doj  ""T2  +  “l27  +  _l^' 


(6.14) 


(6.15) 


Equation  (6.1  )  isn’t  pretty,  but  can  be  simplified  a  little  by  absorbing  the  1/r2  on  the  left  hand  side  into  the 
derivative,  and  completing  the  square  on  the  right  hand  side: 


d0 


2  EL2\ 
ma2 ) 


(6.16) 


We  can  simplify  equation  (  )  further  by  introducing  a  new  variable,  z  =  j  -  ma! I? .  We  also  introduce  a 

dimensionless  constant  e  =  Vl  +  2EL2 /met2  and  an  inverse  length  q  —  mad  if.  With  these  substitutions,  our 
equation  becomes: 


2  2 
-z  +q  . 


(6.17) 


We  can  solve  equation  (  )  by  separation  of  variables: 


I 


vV  -  z2 


:dz=  d0 


/’ 


• arccos | — 


e-e0. 


(6.18) 
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Taking  the  reference  angle  9q  (our  integration  constant)  to  be  zero,  we  find  z(0)  =  qcos(O).  Translating  back 
to  r(6),  we  obtain  a  fairly  simple  solution: 


r(0)  = 


L 2  1 

ma  1  +  ecosO 


(6.19) 


What  the  solution  (6.19)  (the  orbit  of  our  particle  under  the  action  of  the  central  force)  actually  looks  like, 
depends  on  the  value  of  our  dimensionless  variable  e,  known  as  the  eccentricity  of  the  orbit.  To  find  out 
which  orbits  we  can  get,  we  translate  equation  (  )  back  to  Cartesian  coordinates,  using  x  —  rcosO.  Defining 
k  =  L2lma,  we  get  k-r  +  cr  cos 6  —  r  +  ex,  or  r  —  k-  ex.  Now  using  r2  =  x2  +  y2,  we  get 


x2  +  y2  -  (k-  ex)2  =  k2  -  2ekx+  e2x2. 


(6.20) 


We  can  now  distinguish  four  possibilities: 

1.  e  =  0:  In  this  case,  equation  (  )  becomes  x2  +  y2  —  k2,  so  our  orbit  is  a  circle  with  the  origin  at  its 

center. 

2.  0  <  e  <  1:  For  this  case,  with  some  algebra,  we  can  rewrite  equation  (  )  as  ((x- Xq)I a)2  +  ( ylb )2  =  1, 

where  a  =  kl  (\  -  e2),  xo  =  -ea,  and  b  =  kl  Vl-e2.  These  orbits  are  ellipses,  with  the  center  of  the  ellipse 
at  (xo,0),  semi-major  axis  a,  semi-minor  axis  b,  and  focal  length  /  =  V a2  -b2  =  kel(  1  -  e2)  =  -xq.  One 
of  the  foci  thus  lies  at  the  origin. 

3.  e  =  1:  Equation  (6.20)  now  becomes  y2  =  k2  -  2k x,  which  is  the  equation  for  a  parabola  (extending 
along  the  negative  x-axis)  with  its  ‘top’  (in  this  case,  rightmost  point)  at  (kl 2, 0)  and  focal  length  /c/2,  so 
the  (single)  focus  is  again  located  at  the  origin. 

4.  e  >  1:  This  case  again  requires  some  algebra  to  rewrite  equation  (  20)  in  a  recognizable  standard  form: 
((x-  xo )la)2  -  (ylb)2  =  1,  where  a  =  kl (e2  -  1),  xo  -  ea  and  b  =  klVe2  -  1.  These  orbits  are  hyperbola, 
crossing  the  x-axis  at  (xo,0),  and  approaching  asymptotes  y  =  ±b((xla )  -  e),  which  meet  at  (xo  +  a, 0). 
The  focal  length  is  now  /  =  V a2  +  b2  —  kel  (e2  -  1)  =  ea  —  xq  +  a,  so  the  focus  of  the  hyperbola  is  also 
located  at  the  origin. 

In  mathematics,  these  four  possible  types  of  orbits  are  known  as  conic  sections :  the  curves  you  can  get  by  in¬ 
tersecting  a  cone  with  a  plane.  Specifically,  when  the  central  force  is  gravity,  such  as  in  the  solar  system  (where 
the  sun  is  so  much  heavier  than  everything  else  combined  that  to  good  approximation  we  can  describe  orbits 
as  being  determined  by  the  sun’s  gravitational  field  alone),  the  four  cases  listed  above  are  the  only  possible 
orbits  bodies  can  have.  The  planets,  dwarf  planets,  asteroids,  and  many  minor  objects  in  our  solar  system  all 
follow  elliptical  orbits  around  the  sun,  albeit  with  different  eccentricity  -  Earth’s  is  almost  zero  (0.017),  but 
that  of  Mars  is  significantly  higher  (0.09),  and  of  Pluto  much  higher  still  (0.25).  Comets,  on  the  other  hand, 
typically  parabolic  or  hyperbolic  orbits.  Spacecraft  such  as  the  Voyager  and  New  Elorizons  probes  are  often 
put  on  trajectories  past  planets  that  are  not  their  final  destination,  to  pick  up  (or  loose)  speed  through  a  grav¬ 
itational  assist  (in  which  they  take  a  little  bit  of  momentum  from  the  planet’s  orbit);  those  paths  past  planets 
are  typically  hyperbola.  Getting  a  spacecraft  to  orbit  another  planet  (i.e.,  in  a  bound,  so  elliptical)  orbit  is 
actually  much  harder,  but  again,  the  resulting  orbit  is  described  by  the  maths  presented  above. 


6.4.  Kepler’s  laws 

The  fact  that  the  planets  move  in  elliptical  orbits  was  first  discovered  by  Kepler,  based  on  observational  data 
alone  (he  didn’t  have  the  benefit,  as  we  do,  of  living  after  Newton  and  thus  knowing  about  Newton’s  law  of 
gravity) .  Kepler  summarized  his  observational  facts  in  three  laws,  which  we  can,  with  the  benefit  of  hindsight, 
prove  to  be  corollaries  of  Newton’s  laws. 

Theorem  6.2  (Kepler’s  first  law).  The  planets  move  in  elliptical  orbits,  with  the  sun  at  one  of  the  foci. 

Proof  This  is  case  two  of  the  general  result  given  by  equations  (  )  and  (  ).  □ 


1  See  table  for  data  on  the  orbits  of  all  planets  and  a  number  of  their  moons. 
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Johannes  Kepler  (1571-1630)  was  a  German  astronomer  and  mathemati¬ 
cian  who  made  major  contributions  to  understanding  the  motion  of  the 
planets.  Copernicus  had  published  his  heliocentric  (rather  than  geocen¬ 
tric)  view  of  the  universe  posthumously  in  1543,  but  the  two  systems  were 
still  heavily  debated  in  Kepler’s  time.  Having  been  convinced  that  Coper¬ 
nicus  was  right,  Kepler  set  out  to  construct  a  geometric  description  of  the 
solar  system.  He  initially  tried  to  do  so  using  polyhedra  and  Platonic  solids, 
but  found  that  these  could  not  accurately  describe  the  data.  In  1600,  Ke¬ 
pler  met  with  astronomer  Tycho  Brahe,  who  had  made  meticulous  obser¬ 
vations  of  the  known  planets,  and,  having  been  convinced  of  Kepler’s  skills 
in  mathematics,  shared  his  data  with  him.  After  Tycho’s  death  in  1601,  Ke¬ 
pler  succeeded  him  as  imperial  mathematician  in  Prague,  where  he  devel¬ 
oped  his  laws  over  the  next  decade.  Unfortunately,  Kepler’s  Calvinist  views 
got  him  in  trouble  frequently  with  both  the  Catholic  and  the  Lutheran 

church,  which  led  to  his  excummunication,  but  he  managed  to  avoid  fur-  Figlfe  6  2:  Portrait  of  J°hannes 

Kepler  (1610)  [2]  ]. 

ther  persecution  by  moving  frequently,  and  he  always  could  continue  his 
scientific  work.  The  Kepler  spacecraft  and  mission,  launched  in  2009  to 
hunt  for  extrasolar  terrestial  planets,  is  named  in  his  honor. 


Theorem  6.3  (Kepler’s  second  law) .  A  line  segment  joining  a  planet  and  the  sun  sweeps  out  equal  areas  during 
equal  intervals  of  time. 

Proof.  This  law  is  nothing  but  a  special  case  of  conservation  of  angular  momentum.  Consider  a  small  piece 
of  the  orbit,  in  which  the  planet  moves  a  distance  dx.  The  lines  connecting  the  initial  and  final  points  of  this 
piece  of  orbit  with  the  sun  make  an  angle  d£L  If  the  initial  distance  from  the  planet  to  the  sun  was  r,  and  the 
final  distance  r  +  dr,  we  have,  to  first  order,  dx  -  rd9.  The  infinitesimal  area  the  planet  has  swiped  out  is  then 
given  by  (area  of  a  triangle):  dA  =  \r  dx  =  j  r2  d().  If  we  want  to  know  how  much  area  was  swept  out  over  an 
amount  of  time,  we  need  to  know  the  time  derivative  of  A,  which  is  thus  given  by  d Al  d t  -  j  r2  d6 1  df.  Now 
using  that  the  angular  momentum  of  the  planet  is  given  by  L-  mr2d,  we  find 

d A  r2  d 0  L 

—  = - = - ,  (6.21) 

df  2  df  2m 


which  is  constant  if  L  is  conserved. 


□ 


Theorem  6.4  (Kepler’s  third  law).  The  square  of  the  period  T  of  an  orbit  is  proportional  to  the  cube  of  its  semi- 
major-axis  length  a: 


T2 


GMe 


(6.22) 


where  M0  is  the  mass  of  the  sun. 


Proof.  We  integrate  equation  (  )  over  the  period  of  a  whole  orbit,  which  gives  A-  LT  12m.  By  Kepler’s  first 

law,  the  orbit  is  an  ellipse,  so  its  area  equals  A  -  nab,  with  a  and  b  the  ellipse’s  semi-major  and  semi-minor 
axes.  The  two  axes  are  related  by  b  —  a\J  1  -  e2,  with  e  again  the  eccentricity  of  the  ellipse.  Making  these 
substitutions  and  squaring  the  resulting  relation,  we  get: 


n2  a 4  ■ 


m{  1  -  e2)  4 m 


(6.23) 


Using  k  -  L2 1  mot,  like  in  equation  (  ),  and  the  observation  that  for  an  elliptical  orbit  kl[l-  e2)  -  a,  we  get 

L2  lm{\  -  e2)  -  aa.  Now  for  orbits  in  the  solar  system,  a  =  GM0m,  so  we  arrive  at  equation  (  ).  □ 
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6.5.  Problems 

6.1  A  particularly  useful  orbit  for  satellites  is  the  geosynchronous  one:  the  orbit  in  which  the  satellite  rotates 
around  the  Earth  in  exactly  one  day,  so  with  respect  to  the  ground,  it  is  always  in  the  same  position.  Find 
the  altitude  (i.e.,  distance  above  the  Earth’s  surface)  for  a  circular  geosynchronous  orbit. 

6.2  Kepler’s  laws  apply  to  the  case  that  an  object  with  relatively  small  mass  m  orbits  an  object  with  large 
mass  M,  which  we  assume  stays  fixed.  Technically,  this  is  incorrect:  both  objects  rotate  about  their  com¬ 
mon  center  of  mass.  Fortunately,  we  can  still  use  the  expressions  derived  in  this  section,  with  a  small 
modification.  To  see  how  this  works,  we  write  down  the  equations  of  motion  for  the  two  objects,  due  to 
the  force  they  exert  on  each  other: 


*i  =  -— F(r),  x2  =  -J-F(r),  (6.24) 

m  M 

where  Jti  is  the  position  of  the  object  with  mass  m,  x2  that  of  the  object  with  mass  M,  and  r  -  x2~x\  their 
separation.  We  denote  the  position  of  the  center  of  mass  of  the  system  by  R. 

(a)  As  there  is  no  external  force  acting  on  the  system,  the  total  momentum  is  conserved  and  therefore 
the  center  of  mass  cannot  accelerate.  Argue  that  this  implies  that 

{m  +  M)R  =  0,  (6.25) 

and  combine  equations  (6.24)  and  (6.25)  into  an  expression  for  R  in  terms  of  xt ,  x2,  and  the  masses 
of  the  two  objects. 

(b)  From  equation  (  ) ,  also  derive  an  equation  of  motion  for  the  separation  r  between  the  two  objects. 

The  equations  you  found  in  (a)  and  (b)  together  are  equivalent  to  the  equations  of  motion  in  (6.24),  but 
only  one  is  a  differential  equation,  and  they  are  uncoupled:  we  don’t  need  to  know  the  position  of  the 
center  of  mass  to  find  the  separation,  and  vice  versa. 

(c)  Show  that  you  can  re-write  the  equation  of  motion  for  the  separation  between  the  two  objects  as 
F(r)  =  [ir,  where  p  is  the  reduced  mass  that  we  also  encountered  when  studying  collisions  in  the 
center  of  mass  frame,  equation  ( 1.36),  given  by 

mM 

p  = - . 

m  +  M 


Note  that  solving  the  final  equation  for  the  separation  r  is  entirely  equivalent  to  solving  the  equation  of 
motion  of  a  single  particle  under  the  action  of  a  central  force,  with  the  modification  that  the  mass  of  the 
particle  is  replaced  by  the  reduced  mass.  For  the  case  that  m«M,  the  reduced  mass  is  approximately 
equal  to  m. 


(d)  Calculate  the  reduced  mass  of  the  Earth-Moon  two  body  problem.  Can  we  state  that  the  Moon  re¬ 
volves  around  the  Earth? 


(e) 


Nowhere  in  the  derivations  in  this  problem  did  we  assume  that  m  «  M.  The  same  rules  apply  to 
any  two  objects.  Consider  the  opposite  limit:  two  objects  (these  might  for  instance  be  binary  stars)  of 
equal  mass  M  that  rotate  around  their  common  center  of  mass.  Show  that  for  this  case,  for  circular 
orbits  the  orbital  period  is  given  by 


2  2  n2  d 3 

T2= - , 

GM 


(6.26) 


where  d  is  the  distance  between  the  two  objects. 


6.3  A  student  with  mass  65.0  kg  stands  at  the  center  of  a  simple  merry-go-round  that  consists  of  a  large  disk  of 
radius  1.5  m  and  mass  25  kg  and  is  making  a  full  rotation  every  2.0  s.  The  student  walks  out  to  a  distance 
of  0.50  m  from  the  center. 


(a)  Find  the  rotational  frequency  of  the  merry-go-round  with  the  student  at  this  point. 

(b)  What  are  the  forces  acting  on  the  student  at  this  point? 


General  rotational  motion* 


7.1.  Linear  and  angular  velocity 

We  related  the  linear  and  angular  velocities  of  a  rotating  object  in  two  dimensions  in  section  .  There,  we 
also  already  stated  the  relation  between  the  linear  velocity  vector  and  rotation  vector  in  three  dimensions 
(equation  (  )): 

v-toxr.  (7.1) 

It  is  not  hard  to  see  that  this  expression  indeed  simplifies  to  the  scalar  relationship  u  -  tor  for  rotations  in  a 
plane,  with  the  right  sign  for  the  linear  velocity.  That’s  hardly  a  proof  though,  so  let’s  put  this  on  some  more 
solid  footing.  Suppose  r  makes  an  angle  tf>  with  to.  Suppose  also  that  it  changes  by  dr  in  a  time  interval 
df,  then  if  we  have  pure  rotation,  dr  is  perpendicular  to  both  r  and  to,  and  its  magnitude  is  given  by  |  dr|  = 
tor  sin</>df  =  \to  x  r|  At,  where  to  and  r  are  the  lengths  of  their  respective  vectors.  Finally,  as  seen  from  the  top 
(i.e.,  looking  down  the  vector  to),  the  rotation  should  be  counter-clockwise  (by  definition  of  the  direction  of 
to),  which  corresponds  with  the  direction  of  to  x  r.  We  thus  find  that  both  the  magnitude  and  direction  of 
v  =  dr/ df  indeed  equal  to  x  r,  and  equation  (  )  holds. 


7.2.  Rotating  reference  frames 

In  section  ,  we  considered  what  happens  if  we  considered  the  (linear)  motion  of  an  object  from  a  stationary 
(Tab  frame’)  or  co-moving  point  of  view,  with  special  attention  for  the  center  of  mass  frame.  These  frames 
were  moving  with  constant  velocity  with  respect  to  each  other,  and  were  all  inertial  frames  -  Newton’s  first  and 
second  laws  hold  in  all  inertial  frames.  In  this  section,  we’ll  consider  a  rotating  reference  frame,  where  instead 
of  co-moving  with  a  linear  velocity,  we  co-rotate  with  a  constant  angular  velocity.  Rotating  reference  frames 
are  not  inertial  frames,  as  to  keep  something  rotating  (and  thus  change  the  direction  of  the  linear  velocity) 
requires  the  application  of  a  net  force.  Instead,  as  we’ll  see,  in  a  rotating  frame  of  reference  we’ll  get  all  sorts 
of  fictitious  forces  -  forces  that  have  no  real  physical  source,  like  gravity  or  electrostatics,  but  originate  from 
the  fact  that  we’re  in  a  rotating  reference  frame. 

In  a  rotating  reference  frame,  the  direction  of  the  basis  vectors  changes  over  time  (as  measured  with  re¬ 
spect  to  the  stationary  lab  frame  -  this  is  different  from  the  linearly  co-moving  frames,  where  the  directions  of 
the  basis  vectors  remained  constant).  To  see  how  the  basis  vectors  change  over  time,  we  can  simply  calculate 
their  linear  velocity,  using  equation  (5.5): 


dx  ^  dv  A  Az 

—  -  to  x  x,  —  -  tox  v,  —  -  tox  z. 

df  df  df 


We  can  now  easily  determine  the  change  of  an  arbitrary  vector  u  —  uxx  +  uyy+  uzz  over  time: 


Au 

df 


du^ 

dt 

Su 


-X+  ■ 


Ally 

dt 


■y+ 


duz 

dt 


z  +  \u 


dx 

df 


+  u. 


dy 

df 


+  u. 


Az 

df 


=  ——  +  to  x  u, 
St 


(7.2) 
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where  is  defined  by  equation  (  ),  and  represents  the  time  derivative  of  u  in  the  rotating  basis  .  We  see 

that  in  addition  to  the  ‘regular’  time  derivative,  acting  on  the  components  of  u,  we  get  an  additional  term 
u>  x  u  due  to  the  rotation  of  the  system.  Note  that  for  u-u>,  we  find  that  d 0)1  df  =  Scot 8 t  -  u>,  i.e.,  the  time 
derivative  of  the  rotation  vector  is  the  same  in  the  stationary  and  rotating  frames. 

The  prime  example  of  a  vector  is  of  course  the  position  vector  r  of  a  particle,  the  second  derivative  of 
which  appears  in  Newton’s  second  law  of  motion.  We’ll  calculate  that  second  derivative  for  a  position  vector 
in  a  rotating  coordinate  frame.  The  first  derivative  is  a  simple  application  of  equation  (7.3): 


dr 

df 


8r 

—  +  a>  x  r. 
St 


(7.4) 


To  get  the  second  derivative,  we  apply  (  )  to  the  velocity  vector  found  in  (  ) : 

d2r  d  ( Sr 

dr2  =  di\8i  +0)Xr 

8  [Sr  )  (Sr 

=  — - twxr+tux - Kuxr 

8t\8t  )  V  <5  r 

S2r  Soj  Sr 

=  — -  h - x  r  +  2 u>  x - 1  a)  x  (a)  x  r).  (7.5) 

St2  St  St 


Like  in  the  two-dimensional  case  given  by  equation  (6.4),  we  find  that  the  acceleration  in  a  rotating  reference 
frame  picks  up  extra  terms  compared  to  a  stationary  (or  more  general,  inertial)  frame.  To  get  a  complete 
picture,  we  also  allow  the  origin  of  the  rotating  frame  to  be  different  from  that  of  the  stationary  lab  frame.  Let 
Hab  be  the  position  vector  in  the  lab  frame,  R  the  vector  pointing  from  the  origin  of  the  lab  frame  to  that  of  the 
rotating  frame,  and  r  the  position  vector  in  the  rotating  frame.  We  then  have  r\a b  -  R  +  r,  and  for  the  second 
derivative  of  qab  we  find: 


d2nab  d2r  d  2R 

d  t2  d  t2  d  t2 

S2r  Sr  So)  d  2R 

=  _+„*to*r)+2„*_+_,<r  +  _  (7,6) 

We  can  substitute  equation  (  )  in  Newton’s  second  law  of  motion  in  the  lab  frame  (i.e.,  just  F  -  driab/  df)  to 

find  the  expression  for  that  law  in  the  rotating  frame: 


S2r 

m— 75-  =  F-m 
St2 


(1)  x  (o>  x  r)  +  2u>  x  v  +  (i)  x  r  + 


d2i? 

df2 


(7.7) 


where  we  defined  v  -  Sr  IS  t  as  the  velocity  in  the  rotating  frame,  and  used  that  the  time  derivative  of  (1)  is  the 
same  in  both  the  stationary  and  the  rotating  frame.  We  find  that  we  get  four  correction  terms  to  the  force  due 
to  our  transition  to  a  rotating  frame.  They  are  not  ‘real’  forces  like  gravity  or  friction,  as  they  vanish  in  the  lab 
frame,  but  you  can  easily  experience  their  effects,  when  you’re  in  a  turning  car  or  rotating  carousel.  As  they 
have  no  physical  origin,  we  call  these  forces  fictitious.  They  are  known  as  the  centrifugal,  Coriolis,  azimuthal, 
and  translational  force,  respectively: 


Fcf  = 

-  mat  x  (to  x  r) 

(7.8) 

£ 

O 

II 

-2 mw  x  v 

(7.9) 

Faz  — 

-m(i)  x  r 

(7.10) 

d  2R 

^trans  — 

-m—^r 

At2 

(7.11) 

We  encountered  the  centrifugal,  Coriolis  and  azimuthal  force  before  in  section  6.2.  To  see  how  the  ex¬ 
pressions  above  connect  to  the  planar  versions,  let  us  pick  the  coordinates  of  the  rotating  frame  such  that  the 
direction  of  u>  coincides  with  the  z-axis.  The  rotational  motion  and  the  forces  can  then  be  described  in  terms 
of  the  cylindrical  coordinates  consisting  of  the  polar  coordinates  (p,  0)  in  the  xy- plane  and  z  along  the  z-axis 
(note  that  we  use  p  =  \J x2  +  y2  for  the  radial  distance  in  the  xy  plane  instead  of  r,  as  r  is  now  the  distance 


1  Some  authors  use  the  notation  ( ^7 )  t  f°r  ■ 
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Figure  7.1:  Rotation  of  a  dumbbell,  consisting  of  two  equal  masses  m  separated  by  a  distance  d  =  2r  about  an  axis  which  is  not  a 
symmetry  axis  of  the  system. 


from  the  origin  to  our  point  in  three  dimensions).  For  the  centrifugal  force  we  have  u>  ■  Fc f  =  0,  so  it  lies  in  our 
newly  defined  xy  plane,  and  in  cylindrical  coordinates  it  can  be  expressed  as 

Fc f  =  —m  [&>(&>-  r)  -  w2r]  =  ma>2{xx  +  yy)  —  mco2pp.  (7.12) 

The  centrifugal  force  is  thus  nothing  but  minus  the  centripetal  force,  which  we  already  encountered  for  uni¬ 
form  rotational  motion  in  equation  (  ),  and  as  the  second  term  in  (  ).  The  centrifugal  force  is  the  force 

you  ‘feel’  pushing  you  sidewards  when  your  car  makes  a  sharp  turn,  and  is  also  responsible  for  creating  the 
parabolic  shape  of  the  water  surface  in  a  spinning  bucket,  see  problem  7.4. 

The  Coriolis  force  is  present  whenever  a  particle  is  moving  with  respect  to  the  rotating  coordinates,  and 
tends  to  deflect  particles  from  a  straight  line  (which  you’d  get  in  an  inertial  reference  frame) .  We  have  tu-Fcor  = 
v  ■  Fcor  =  0,  so  the  Coriolis  force  is  perpendicular  to  both  the  rotation  and  velocity  vectors  -  note  that  this  is 
the  velocity  in  the  rotating  frame.  In  the  two-dimensional  case,  we  had  Fcor  =  2 mpwO  (equation  6.8),  which 
for  a  velocity  in  the  radial  direction,  p,  gives  a  force  in  the  angular  direction  0. 

The  azimuthal  force  occurs  when  the  rotation  vector  of  our  rotating  system  changes  -  i.e.,  when  the  rota¬ 
tion  speeds  up  or  decelerates,  or  the  plane  of  rotation  alters.  In  either  case  we  have  r  •  Fc0r  -  0,  so  the  force  is 
perpendicular  to  the  position  vector.  If  it  is  the  magnitude  of  the  rotation  vector  that  changes,  and  we  again 
take  the  rotation  vector  to  lie  along  the  z-axis  in  the  rotating  frame,  di  also  lies  along  the  z-axis,  and  we  get 

Faz- -mtiizx  r  —  -mrOO,  (7.13) 

so  the  azimuthal  force  is  minus  m  times  the  tangential  acceleration  a  (see  equation  5.8),  or  minus  the  first 
term  of  (6.7). 

The  translational  force  finally  occurs  when  the  rotating  reference  frame’s  origin  accelerates  with  respect 
to  that  of  the  stationary  lab  frame.  You  also  feel  it  if  the  ‘rotating’  reference  frame  is  actually  not  rotating,  but 
only  accelerating  linearly  -  it’s  the  force  that  pushes  you  back  in  your  seat  when  your  car  or  train  accelerates. 

7.3.  Rotations  about  an  arbitrary  axis 
7.3.1.  Moment  of  inertia  tensor 

In  chapter  5,  we  studied  the  rotation  of  rigid  bodies  about  an  axis  of  symmetry.  For  these  cases,  we  have 
L  =  Io),  where  I  is  the  moment  of  inertia  with  respect  to  the  rotation  axis.  We  already  noted  that  I  depends  on 
which  axis  we  pick,  and  that  the  proportional  relation  between  the  rotation  vector  and  angular  momentum 
is  not  the  most  general  possibility.  In  this  section,  we’ll  derive  the  more  general  form,  in  which  the  number  I 
is  replaced  by  a  2-tensor,  i.e.,  a  map  from  a  vector  space  (here  R3)  into  itself,  represented  by  a  3  x  3  matrix. 
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To  arrive  at  the  more  general  relation  between  L  and  ft),  we  go  back  to  the  original  definition  oiL  -  r  x  p, 
and  consider  the  motion  of  a  dumbbell  around  an  axis  which  is  not  a  symmetry  axis  (see  figure  ).  If  the 
dumbbell  makes  an  angle  9  with  the  rotation  axis,  and  rotates  counter-clockwise  as  seen  from  the  top,  we  get: 

L-  mr  x  v+  m{—r)  x  (- v ) 

=  2 mr  x  (to  x  r) 

-2  m[(x){r  ■  r)-  r{r  ■  ft))] 

=  2mr2a) -2mcor  cosdr  (7.14) 


where  we  used  equation  (  )  relating  the  linear  velocity  v  to  the  rotational  velocity  ft)  through  v  —  u>  x  r. 

Equation  (  )  shows  that  for  a  rotation  about  an  arbitrary  axis  through  the  center  of  the  dumbbell,  we  get 

two  terms  in  L.  The  first  term,  2 mr2u>,  is  the  rotation  about  an  axis  perpendicular  to  the  dumbbell,  and 
equals  let)  for  I  —  2 mr2,  as  we  found  in  section  .  The  second  term,  -2ma>rcosQr,  tells  us  that  in  general 
we  also  get  a  component  of  L  along  the  axis  pointing  from  the  rotation  center  to  the  rotating  mass  (i.e.,  the 
arm).  Note  that  the  two  terms  cancel  when  9  =  n/2,  as  wed  expect  for  in  that  case  the  moment  of  inertia  of 
the  dumbbell  is  zero. 

We  can  easily  generalize  equation  (7.14)  to  any  set  of  masses  ma  with  position  vectors  ra  (where  the  index 
a  runs  over  all  particles),  and  with  a  rotation  ft)  about  an  arbitrary  axis: 

L  =  YJmaraX(Mxra)  =  I-(D.  (7.15) 

a 

The  moment  of  inertia  tensor  is  defined  by  equation  (7.15) .  It  is  a  symmetric  tensor,  mapping  a  vector  ft)  in  R3 
onto  another  vector  L  in  R3.  In  Cartesian  coordinates,  we  can  express  its  nine  components  as  three  moments 
of  inertia  about  the  x,  y  and  z  axes,  which  will  be  the  diagonal  terms  of  /: 

hx  =  Y.m^yl  +  =  +  hz  =  £  tna{x2a  +  y2a),  (7.16) 

a  a  a 

and  three  products  of  inertia  for  the  off-diagonal  components: 

~  lyx  ~  ~  y.  wiaxaya  Ixz  —  Izx  —  —  y.  maxaza  I v z  ~  Izv  =  ~~  y"!  wiayaza.  (7.17) 

a  a  a 

We  can  also  write  equations  (  )  and  (  )  more  succinctly  using  index  notation,  where  i  and  j  run  over  x, 

y  and  z,  and  we  use  the  Kronecker  delta  8  ij  which  is  one  if  i  =  j  and  zero  if  if  j: 

hj  =  'ZJrnAr28ij  -  rtr j).  (7.18) 

a 


Equations  (7.16)  and  (7.17)  generalize  to  continuous  objects  in  the  same  way  equation  (5.13)  generalized 
to  (  ) .  Using  the  index  notation  again,  we  can  explicitly  write: 


I 


ij  ~ 


nrj)p{r)  dV. 


(7.19) 


The  moment  of  inertia  tensor  contains  all  information  about  the  rotational  inertia  of  an  object  (or  a  col¬ 
lection  of  particles)  about  any  axis.  In  particular,  if  one  of  the  axes  (say  the  z-axis)  is  an  axis  of  symmetry,  we 
get  that  Ixz  =  Iyz  —  0,  and  for  rotations  about  that  axis  (so  u>  -  toz),  we  retrieve  L  =  Iza>. 

In  addition  to  calculating  the  angular  momentum,  we  can  also  use  the  moment  of  inertia  tensor  for  cal¬ 
culating  the  kinetic  energy  for  rotations  about  an  arbitrary  axis.  We  have: 

1  V- 

K-  ~  2^  ma  va  ■  va 

2  a 

1  v- 

=  -  2^  tna((x)  x  ra)  ■  (ft)  x  ra) 

^  a 

Y,mafa  x  (ft)  x  ra) 

a 

1 

=  -ft)  •  L  =  -ft)  •  ft). 

2  2 


(7.20) 
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Leonhard  Euler  (1707-1783)  was  a  Swiss  mathematician  who  made  major 
contributions  to  many  different  branches  of  mathematics,  and,  by  appli¬ 
cation,  physics.  He  also  introduced  much  of  the  modern  mathematical 
terminology  and  notation,  including  the  concept  of  (mathematical)  func¬ 
tions.  Euler  was  possibly  the  most  prolific  mathematician  who  ever  lived, 
and  likely  is  the  person  with  the  most  equations  and  formula’s  named  after 
him.  Although  his  father,  who  was  a  pastor,  encouraged  Euler  to  follow  in 
his  footsteps,  Euler’s  tutor,  famous  mathematician  (and  family  friend)  Jo¬ 
hann  Bernoulli  convinced  both  father  and  son  that  Euler’s  talent  for  math¬ 
ematics  would  make  him  a  giant  in  the  field.  Famous  examples  of  Euler’s 
work  include  his  contributions  to  graph  theory  (the  Koningsberg  bridges 
problem),  the  relation  e 171  +  1  =  0  between  five  fundamental  mathematical 
numbers  named  after  him,  his  work  on  power  series,  a  method  for  numer¬ 
ically  solving  differential  equations,  and  his  work  on  fluid  mechanics  (in 
which  there  is  also  an  ‘Euler’s  equation’). 


Figure  7.2:  Portrait  of  Leonhard 
Euler  by  Jakob  Handmann  (1753) 
[22]. 


7.3.2.  ER’S  EQUATIONS 

In  the  lab  frame,  we  have  equation  (  .2:  )  relating  the  torque  and  the  angular  momentum.  We  used  this  equa¬ 
tion  to  prove  conservation  of  angular  momentum  in  the  absence  of  a  net  external  torque,  and  to  study  pre¬ 
cession.  However,  for  rotations  about  an  arbitrary  axis,  it  is  easier  to  transform  to  a  frame  in  which  we  rotate 
with  the  object,  much  like  moving  with  the  center  of  mass  makes  the  study  of  collisions  much  easier.  We’ve 
already  done  the  math  for  transforming  to  a  co-rotating  frame  in  section  .2;  here  we  only  need  the  result  in 
equation  (7.3)  to  find  the  time  derivative  of  the  angular  momentum  in  the  rotating  frame.  Equation  (5.25) 
then  translates  to: 

SL 

T  =  — —  +  (x)  x  L  - /•  (i)  +  (x)  X  [I-O)).  (7.21) 

St 


Now  since  /  is  symmetric,  all  its  eigenvalues  are  real,  and  its  eigenvectors  are  a  basis  for  IR3;  moreover,  for 
distinct  eigenvalues  the  eigenvectors  are  orthogonal,  so  from  the  eigenvector  basis  we  can  easily  construct 
an  orthonormal  basis  (tq ,  e-2,  ef)  of  eigenvectors  corresponding  to  the  three  eigenvalues  I\,  I2  and  I3.  If  we 
express  the  moment  of  inertia  tensor  in  this  orthonormal  eigenvector  basis,  its  representation  becomes  a 
simple  diagonal  matrix,  /  =  diagUi,  I2, I3).  We  call  the  directions  e,-  the  principal  axes  of  our  rotating  object, 
and  the  associated  eigenvalues  the  principal  moments  of  inertia.  The  construction  of  the  principal  axes  and 
moments  of  inertia  works  for  any  object  -  including  ones  that  do  not  exhibit  any  kind  of  symmetry.  If  an  object 
does  have  a  symmetry  axis,  that  axis  is  usually  also  a  principal  axis,  as  can  easily  be  checked  by  calculating 
the  products  of  inertia  with  respect  to  that  axis  (they  vanish  for  a  principal  axis) . 

If  we  express  our  rotational  quantities  in  the  principal  axis  basis  {uni  tvecet }  of  our  rotating  object,  our 
equations  become  much  simpler.  We  have 
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or  in  components:  L,-  =  Jj-aq.  Equation  (7.21)  simplifies  to: 
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,  I3LD3  J 

(7.23) 


which  gives  for  the  three  components  of  the  torque: 


f  1  =  h<i>\  +  H3  -  l2)(L>2,(t>2i 
+2  =  h(i>2  +  Ul  ~  73l<+>i<+>3, 
+3  =  73^3  +  U2  - 


(7.24) 


Equations  (  .2  )  are  known  as  Euler’s  equations  (of  a  rotating  object  -  the  classification  is  necessary  as  there 
are  many  equations  associated  with  Euler). 
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As  an  example,  let’s  apply  Euler’s  equations  to  our  dumbbell.  We  take  the  origin  at  the  pivot,  i.e.,  where  the 
rotation  axis  crosses  the  dumbbell’s  own  axis.  The  dumbbell  does  have  rotational  symmetry,  about  the  axis 
connecting  the  two  masses  -  let’s  call  that  the  3-axis.  The  other  two  axes  then  span  the  plane  perpendicular 
to  the  dumbbell;  we  can  pick  any  orthonormal  pair  for  the  1  and  2-axes.  The  rotation  vector  in  this  basis  is 
given  by 


f to 1  \ 

f  °  1 

(X)  = 

0>2  =  0) 

sind 

,  0)3  ) 

k  COS0  ) 

The  products  of  inertia  vanish;  the  principal  moments  are  given  by  h-h  -  \mdz  (with  d  the  distance 
between  the  two  masses)  and  I3  =  0.  As  long  as  the  rotational  velocity  is  constant  {do  -  0),  we  get  from  Euler’s 
equations  that  iq  =  T3  =  0,  and  iq  =  -  |  md2oo2  sin 0 cost).  We  can  thus  rotate  our  dumbbell  about  an  axis 
that’s  not  a  symmetry  axis,  but  at  a  price:  it  exerts  a  torque  on  its  support,  which  in  turn  exerts  a  counter¬ 
torque  to  keep  the  dumbbell’s  rotation  axis  in  place.  This  torque  will  change  the  angular  momentum  of  our 
dumbbell  over  time.  If  we  remove  the  force  exerting  the  counter-torque  (e.g.,  if  our  dumbbell  is  supported  at 
the  pivot,  remove  the  support) ,  the  dumbbell  will  turn,  in  our  example  about  axis  2,  until  the  rotation  vector  w 
has  become  parallel  with  the  angular  momentum  vector  L. 

For  the  dumbbell,  as  it  has  a  rotational  symmetry,  two  of  the  principal  moments  are  identical.  There 
are  many  objects  that  have  no  such  symmetry,  but  they  still  have  three  well-defined  principal  axes.  While 
for  the  dumbbell  rotation  about  any  of  the  principal  axes  is  stable,  this  is  not  the  case  for  an  object  with 
three  different  principal  moments.  A  good  example  is  a  tennis  racket,  whose  principal  axes  are  sketched  in 
figure  ’.3.  The  accompanying  theorem  about  the  stability  of  rotations  about  these  axes  is  easily  demonstrated 
with  a  tennis  racket,  and  bears  its  name. 

Theorem  7.1  (Tennis  racket  theorem).  If  the  three  principal  moments  of  inertia  of  an  object  are  different  (say 
/ 1  <  i‘2  <  I3),  then  rotations  about  the  principal  axes  1  and  3  associated  with  the  maximum  and  minimum 
moments  I\  and  /->,  are  stable,  but  those  about  the  principal  axis  2  associated  with  the  intermediate  moment  I2 
are  unstable. 


Proof.  For  rotations  about  a  principal  axis,  the  torque  is  zero  (by  construction),  so  Euler’s  equations  read 


do  1  + 
do  2  + 
do  3  + 


h  ~  h 
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h-h 
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h-h 
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0. 


(7.26) 


If  we  rotate  about  axis  1,  then  (1)2  and  (1)3  are  (at  least  initially)  very  small,  so  the  first  line  in  (  )  gives  toj  =  0. 

We  can  then  derive  an  equation  for  0)2  by  taking  the  time  derivative  of  the  second  line  of  (  2f )  and  using  the 
third  line  for  0)3,  which  gives: 


0-d>2  + 


h-h 
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[d)\(i)3  +  0)  10)3)  —002- 


h-hh-h 
h  h 


oo\oo2. 


(7.27) 


Now  {I\-If]l  h  <  0,  (h~h)lh  >  0,  and  to2  >  0,  and  o>2  satisfies  the  differential  equation  do2  =  -coo  2,  with  c  >  0. 
Solutions  to  this  equation  are  of  course  sines  and  cosines  with  constant  amplitude.  Although  002  can  thus  be 
finite,  its  amplitude  does  not  grow  over  time  (and  will  in  fact  decrease  due  to  drag),  so  rotations  about  axis 
2  are  opposed.  Similarly,  we  find  that  rotations  about  axis  3  cannot  grow  in  amplitude  either,  and  rotations 
about  axis  1  are  stable.  We  can  repeat  the  same  argument  for  axes  2  and  3.  For  axis  3,  we  find  that  rotations 
about  the  other  two  axes  are  likewise  opposed,  so  rotations  about  this  axis  are  stable  as  well.  For  axis  2  on  the 
other  hand,  we  find  that 

„  h~  h  h~h  2  „  om 

0  =  coi - ooijoo  1,  (7.28) 

h  h 

or  do  1  =  coo  1,  with  c  another  positive  constant.  Solutions  to  this  equation  are  not  sines  and  cosines,  but  ex¬ 
ponential:  o)\(t)  —  Aexpi^/ct)  +  Bexpl-y/ct),  which  means  that  for  any  finite  initial  rotation  about  the  1  axis, 
the  amplitude  of  this  rotation  will  grow  over  time,  and  rotations  about  the  2-axis  are  thus  unstable.  □ 
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Figure  7.3:  The  three  principal  axes  of  a  tennis  racket. 

7.4.  Problems 

7.1  Two  Delft  students  wish  to  re-create  Galilei’s  experiment  dropping  objects  with  different  mass  from  a  high 
tower.  They  use  the  tower  of  the  Old  Church  in  Delft,  which,  like  the  more  famous  one  in  Pisa  supposedly 
used  by  Galilei,  leans  over  somewhat.  The  tip  of  the  tower  is  75  m  above  street  level,  and  2.0  nr  removed 
from  the  vertical.  The  student  who  will  drop  the  objects  stands  on  the  trans  at  60  nr.  The  base  of  the  tower 
is  a  square  of  10  x  10  m. 

(a)  How  far  from  the  base  of  the  tower  do  you  expect  the  stone  to  fall? 

(b)  The  second  student,  who  has  done  the  same  calculation  you  did  in  (a),  has  put  a  camera  close  to  the 
floor  aimed  at  the  spot  where  the  objects  will  drop.  Surprisingly,  in  a  test  drop  of  a  single  stone,  he 
observes  that  the  actual  position  the  stone  hits  the  ground  deviates  from  this  spot.  The  student  at  the 
top  however  insists  that  she  dropped  the  stone  straight  down  from  the  tower  trans,  as  agreed.  The 
students  therefore  go  back  to  their  Mechanics  books  and  realize  that  they  forgot  to  account  for  the 
rotation  of  the  Earth.  Which  of  the  (fictional)  forces  described  in  this  section  could  cause  the  stone 
to  deviate  from  its  straight  path  down? 

(c)  Delft  is  located  on  the  Northern  hemisphere.  In  which  direction  will  the  trajectory  of  the  stone  be 
deflected? 

(d)  Delft  is  at  52.0°N.  What  is  the  magnitude  of  the  deflection  of  the  dropped  stone  on  the  ground?  You 
may  neglect  air  resistance  in  this  calculation. 

7.2  Foucault’s  pendulum  A  well-known  (and  conclusive)  proof  of  the  fact  that  the  Earth  is  rotating  is  pro¬ 
vided  by  a  Foucault  pendulum,  first  presented  by  French  physicist  Leon  Foucault  in  1851  (a  replica  of  his 
device  is  on  permanent  exhibit  in  the  Pantheon  in  Paris,  as  well  as  in  many  other  science  musea  around 
the  world,  see  figure  ) .  A  key  part  of  this  pendulum  is  the  way  its  pivot  is  constructed:  it  has  to  be  rota- 
tionally  symmetric  and  frictionless,  so  it  can’t  exert  any  torques  on  the  pendulum  itself.  Consequently,  the 
plane  in  which  the  pendulum  oscillates  will  remain  unchanged  ,  even  as  the  Earth  rotates.  Therefore,  for 
observers  on  Earth,  the  plane  of  the  pendulum  seems  to  rotate  over  time.  To  see  how  this  works,  consider 
putting  this  pendulum  at  the  North  pole.  Then  for  an  external  observer,  the  plane  of  the  pendulum  stays 

2  With  respect  to  the  (relatively)  fixed  frame  provided  by  a  collection  of  distant  stars. 
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Figure  7.4:  Foucault's  pendulum,  (a)  Coordinates  in  Paris,  (b)  For  a  pendulum  with  a  long  string  and  small  amplitude,  the  velocity  of  the 
bob  will  be  almost  horizontal,  (c)  The  replica  of  Foucault’s  original  pendulum  at  the  Pantheon  [  ] . 


fixed  (there  are  no  forces  acting  on  it),  while  the  Earth  (looking  down  on  the  North  pole)  rotates  counter¬ 
clockwise;  for  an  Earth-bound  observer,  the  pendulum’s  plane  thus  seems  to  move  clockwise  (again  as 
seen  from  the  top),  making  one  full  revolution  in  one  day. 

Paris  is  not  on  the  North  pole,  but  it  does  lie  on  the  Northern  hemisphere,  so  the  pendulum  will  still 
appear  to  rotate  clockwise,  just  at  a  slower  frequency.  We’ll  calculate  this  precession  frequency  in  this 
problem. 

(a)  First,  we  need  the  angular  velocity  in  Paris,  in  a  useful  coordinate  system.  Define  the  z  axis  as  pointing 
upwards  in  Paris,  and  x  as  the  tangent  to  the  planet  due  North  (see  figure  a).  Express  w  in  these 
coordinates. 

(b)  If  the  pendulum  has  a  very  long  string  (the  original  Foucault  one  is  67  m)  compared  to  its  amplitude, 
the  velocity  v  of  the  weight  will  be  roughly  in  the  horizontal  direction,  see  figure  b.  Argue  why,  in 
this  case,  the  component  of  a>  in  the  x  direction  will  not  change  the  frequency  at  which  the  plane  of 
the  pendulum  precesses. 

(c)  The  pendulum’s  plane  rotates  with  a  (precession)  frequency  Wp  =  Wp z  with  respect  to  the  Earth’s 
frame  fixed  in  Paris.  This  precession  frequency  must  exactly  compensate  for  the  Earth’s  rotation  in 
the  frame  of  the  pendulum  (as  in  that  frame,  there  are  no  forces  acting  on  the  pendulum,  and  thus  its 
plane  of  oscillation  stays  fixed).  Show  that  these  considerations  imply  that  cjp  =  -aicosff. 

(d)  Suppose  that  you  enter  the  Pantheon  at  noon,  and  mark  the  direction  in  which  the  pendulum  is 
oscillating.  When  you  return  an  hour  later,  by  how  much  will  this  plane  have  rotated?  Will  this  be 
enough  to  be  visible  by  eye?  The  Pantheon  is  at  48°50'46"N  (note  that  degrees,  like  hours,  are  divided 
in  (arc)minutes  and  seconds  that  run  up  to  60,  not  100). 

7.3  An  alternative  way  to  show  the  effect  of  the  rotation  of  the  Earth  involves  only  a  smooth  horizontal  plane 
and  a  particle  that  can  slide  over  it.  Show  that  if  the  particle’s  velocity  is  v,  its  trajectory  will  be  a  circle 
with  radius  r  =  vl2Q,  where  Q  is  the  Earth’s  rotational  velocity. 

7.4  The  centrifugal  force  emerges  in  a  rotating  coordinate  frame,  and  famously  causes  the  parabolic  shape  of 

the  surface  of  water  in  a  rotating  bucket.  As  the  centrifugal  force  is  always  perpendicular  to  the  rotation 
axis,  we  can  pick  coordinates  such  that  the  rotation  axis  coincides  with  the  z-axis,  u>  =  wz,  and  we  can 
express  the  centrifugal  force  in  cylindrical  coordinates  as  Fcp  =  mw2pp  (equation  ). 

We  now  consider  a  small  volume  of  water  at  the  rotating  surface  (in  steady-state).  There  are  two  forces 
acting  on  this  mass  of  water:  gravity  (pointing  down,  as  always)  and  the  centrifugal  force,  pointing  out¬ 
ward,  see  figure  .  The  resulting  net  force  cannot  have  a  component  along  the  surface,  as  this  would 
result  in  an  acceleration  of  the  water  (and  hence  a  water  flow);  therefore,  the  force  must  be  perpendicular 
to  the  surface,  and  counterbalance  the  pressure  in  the  water  (just  like  it  would  for  the  flat  surface  of  water 
in  a  bucket  that  is  not  rotating) . 
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Figure  7.5:  Parabolic  water  surface  in  a  rotating  bucket.  The  axis  of  rotation  coincides  with  the  axis  of  symmetry.  For  a  small  volume  of 
water  at  the  surface,  the  gravitational  force  and  the  centrifugal  force  add  up  to  an  effective  gravitational  force,  which  must  be  perpendic¬ 
ular  to  the  water  surface  in  steady-state.  This  net  force  is  counterbalanced  by  the  pressure  in  the  water  ( Fr ). 


The  gravitational  and  centrifugal  force  add  up  to  what  is  known  as  an  effective  gravity,  given  by 

geff=-gz  +  <»2PP •  (7.29) 

(a)  Find  the  angle  9  the  direction  of  the  effective  gravitational  force  makes  with  the  vertical  (see  fig¬ 
ure  7.5). 

(b)  If  the  gravitational  force  is  to  be  perpendicular  to  the  water  surface,  we  must  have 

dz 

—  =  tand. 
dp 

Integrate  this  equation  to  find  z(p)  (and  thus  the  shape  of  the  surface). 

(c)  Find  the  potential  energy  corresponding  to  the  effective  gravitational  force,  Fe g-  =  mge ff. 

(d)  Argue  why  the  potential  energy  must  be  constant  on  the  water  surface,  and  from  this  condition,  again 
derive  the  shape  of  the  water  surface. 


Oscillations 


8.1.  Oscillatory  motion 

8.1.1.  Tarmonic  oscillator 

We’ve  already  encountered  two  examples  of  oscillatory  motion  -  the  rotational  motion  of  chapter  5,  and  the 
mass-on-a-spring  system  in  section  2.3  (see  figure  ).  The  latter  is  the  quintessential  oscillator  of  physics, 
known  as  the  harmonic  oscillator.  Recapping  briefly,  we  get  its  equation  of  motion  by  considering  a  mass  m 
that  is  being  pulled  on  by  a  massless  ideal  spring  of  spring  constant  k.  Equating  the  resulting  spring  force 
(Hooke’s  law)  to  the  net  force  in  Newton’s  second  law  of  motion,  we  get: 

mx  =  -kx.  (8.1) 

The  harmonic  oscillator  is  characterized  by  its  natural  frequency  o)q: 


as  follows  readily  by  dimensional  arguments  (or,  of  course,  by  solving  the  differential  equation).  Because 
equation  (8.1 )  is  second-order,  its  solution  has  two  unknowns;  moreover,  as  it  has  to  be  minus  its  own  deriva¬ 
tive  we  readily  see  that  it  should  be  a  linear  combination  of  sines  and  cosines  (for  a  formal  derivation,  see 
appendix  t.3.2 ) .  We  can  write  the  solution  in  two  different  ways: 

v(0) 

x{t)  -  x(0)  cos(wo  t)  h - stn(woi),  (8.3) 

Wo 

=  A  cos  (o»ot +  </>),  (8.4) 

where  the  phase  tp  is  given  by  tan  <p  -  and  the  amplitude  A  by  A  =  Unsurprisingly,  as  they 

are  both  simple  periodic  motions,  there  is  a  direct  relationship  between  a  harmonic  oscillator  with  natural 
frequency  oj(],  and  a  point  on  a  disk  rotating  with  uniform  angular  velocity  aig  in  the  xy-plane  -  the  motion  of 
the  harmonic  oscillator  is  that  of  the  disk  projected  on  the  x  (or  y)  axis. 

8.1.2.  IRSIONAL  OSCILLATOR 

A  torsional  oscillator  is  the  rotational  analog  of  a  harmonic  oscillator  -  imagine  a  disk  with  moment  of  inertia 
I  suspended  by  a  massless,  frictionless  rope  that  has  a  torsional  constant  k,  i.e.,  the  force  to  twist  the  rope  is 
given  by  F  —  -k8,  with  9  the  twist  angle.  By  invoking  the  rotational  analog  of  Newton’s  second  law  of  motion, 
equation  (  ) ,  we  readily  find  for  the  equation  of  motion  of  the  torsional  oscillator: 


16  =  -kB,  (8.5) 

so  the  torsional  oscillator  indeed  is  the  exact  rotational  analog  of  the  harmonic  oscillator,  and  has  a  natural 
frequency  of  Wq  -  VkI I. 
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Christiaan  Huygens  (1629-1695)  was  a  Dutch  physicist  and  astronomer, 
and  one  of  the  major  figures  in  the  scientific  revolution.  Huygens  invented 
the  pendulum  clock  in  1656,  which  revolutionized  timekeeping  and  re¬ 
mained  the  most  accurate  clock  for  300  years.  Huygens  was  also  the  first 
to  cast  the  laws  of  physics  in  mathematical  form,  writing  down  an  early 
(quadratic)  version  of  Newton’s  second  law  of  motion,  the  equation  for 
the  centripetal  force  (eq.  .10),  and  the  correct  form  of  the  laws  of  elas¬ 
tic  collisions  (section  ).  Observing  two  pendulum  clocks  on  the  same 
wall,  Huygens  observed  that  they  synchronized  (see  section  8.' ).  Huy¬ 
gens’  study  of  optics  led  him  to  formulate  the  wave  theory  of  light,  which 
can  correctly  predict  light  diffraction.  In  astronomy,  he  discovered  the  first 
feature  on  the  surface  of  Mars,  the  largest  moon  of  Saturn  (Titan),  and  that 
the  previously  observed  ‘shape  changes’  of  Saturn  were  due  to  the  pres¬ 
ence  of  its  rings.  The  Huygens  probe  that  landed  on  Titan  in  2005  was  very 
appropriately  named  in  his  honor. 


Figure  8.1:  1671  portrait  of  Huy¬ 
gens  by  Caspar  Netscher  [23]. 


8.1.3.  Pendulum 

A  pendulum  is  an  object  that  is  suspended  on  a  horizontal  peg  through  any  point  xp  but  its  center  of  mass  xcm 
(it  won’t  oscillate  if  you  pin  it  at  the  center  of  mass).  If  the  center  of  mass  of  the  pendulum  is  pulled  sideways, 
gravity  will  exert  a  torque  around  the  position  of  the  peg,  pulling  the  pendulum  back  down.  If  the  distance  be¬ 
tween  xp  and  xcm  is  L,  and  the  line  connecting  them  makes  an  angle  6  with  the  vertical  through  xp,  then  the 
torque  exerted  by  gravity  around  xp  equals  -mgL  sind,  where  as  usual  m  is  the  mass  of  the  pendulum.  Now 
again  invoking  equation  5.1  ,  we  can  write  for  the  equation  of  motion  of  the  pendulum  (with  I  its  moment  of 
inertia  about  xp): 

Id  —  -  mgL  sin  6.  (8.6) 

Unfortunately  we  can’t  solve  equation  (8i).  For  small  angles  however,  we  can  Taylor-expand  the  sine,  and 
write  sinO  ~  9,  which  takes  us  back  to  the  harmonic  oscillator  equation.  From  that  we  find  that  for  this 
pendulum  (called  the  physical  pendulum),  the  natural  frequency  is  cjq  =  mgLII.  For  the  special  case  that 
the  pendulum  consists  of  a  mass  m  suspended  on  a  massless  rope  of  length  L  (the  simple  pendulum ),  we 
have  I  =  mL2  and  thus  (Oo  -  \J g!L. 

8.1.4.  Oscillations  in  a  potential  energy  landscape 

The  potential  energy  associated  with  a  mass  on  a  spring  has  a  very  simple  form:  Us(x)  =  ^kx2  (see  equa¬ 
tion  3.15).  The  potential  energy  landscape  of  a  harmonic  oscillator  thus  has  the  shape  of  a  parabola.  Now 
that’s  a  shape  that  we  encounter  very  often:  the  shape  of  pretty  much  every  landscape  about  a  minimum 
closely  resembles  a  parabola  .  To  see  why  this  is  the  case,  simply  Taylor-expand  the  potential  energy  about  a 
minimum  at  xo:  because  the  function  has  a  minimum  at  xo,  U'(xq)  -  0,  and  the  Taylor  expansion  gives 

U{x)  =  U{xq)+^U'\xq)x2 +  ©{x2).  (8.7) 

Around  a  minimum  in  the  potential  energy,  any  potential  energy  thus  resembles  that  of  a  harmonic  oscillator. 
Any  particle  placed  in  such  a  potential  energy  landscape  close  to  a  minimum  (i.e.,  a  particle  on  which  a  force 
acts  close  to  the  point  where  the  force  vanishes)  will  therefore  tend  to  oscillate.  By  comparing  equation  (8  ) 
with  the  potential  energy  of  the  harmonic  oscillator,  we  can  immediately  read  off  that  the  resulting  oscillatory 
motion  is  identical  to  that  of  a  harmonic  oscillator  with  spring  constant  k  =  U"  (xp).  A  particle  released  close 
to  a  minimum  of  the  potential  energy  will  thus  oscillate  with  a  frequency  m  =  \JU"  {xp)lm. 

8.2.  Damped  harmonic  oscillator 

So  far  we’ve  disregarded  damping  on  our  harmonic  oscillators,  which  is  of  course  not  very  realistic.  The  main 
source  of  damping  for  a  mass  on  a  spring  is  due  to  drag  of  the  mass  when  it  moves  through  air  (or  any  fluid, 
either  gas  or  liquid).  For  relatively  low  velocities,  drag  forces  on  an  object  scale  linearly  with  the  object’s 
velocity,  as  illustrated  by  Stokes’  law  (equation  2.1  ).  For  an  object  of  arbitrary  shape  moving  through  an 


Uhe  only  exception  being  functions  of  the  form  x2n  for  n>  1. 
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X(t) 


Figure  8.2:  Position  as  function  of  time  for  four  types  of  oscillation:  undamped  (£  =  0,  blue),  underdamped  (0  <  (  <  1,  orange),  critically 
damped  ((  =  1,  green)  and  overdamped  (£  >  1,  red).  In  all  cases  the  initial  conditions  are  x(0)  =  1  and  i/( 0)  =  0. 

arbitrary  fluid  we’ll  write  F^rag  =  -yx,  with  y  the  drag  coefficient,  and  of  course  opposing  the  direction  of 
motion.  Adding  this  to  the  spring  force  gives  for  the  equation  of  motion  of  the  damped  harmonic  oscillator: 

mx--yx-kx.  (8.8) 

We  now  have  two  numbers  that  determine  the  motion:  the  undamped  frequency  to o  =  Vklm  and  the  damp¬ 
ing  ratio  (  —  y/2V  ink.  In  terms  of  these  parameters,  we  can  rewrite  equation  (  )  as: 

x  +  2(tt>ox  +  u)qX  —  0.  (8.9) 

The  solution  of  equation  (8.9)  depends  strongly  on  the  value  of  ( ,  see  figure  8."  .  We  can  find  it2  by  substituting 
the  Ansatz  x(t)  =  eAt ,  which  gives  a  characteristic  equation  for  A: 

A2  +  2fw0A  +  o>q  =  0,  (8.10) 


so 

A=-Cw0±aioV/C2-l.  (8-11) 

For  C  <  1,  there  are  two  complex  solutions  for  A,  and  we  find  that  x(  t)  undergoes  an  oscillation  with  an  expo¬ 
nentially  decreasing  amplitude: 


x{t)-e  ^"°f[Acos(<Urff)  +  Bsin(Wdt)],  (8.12) 

where  co^  -  a>o  \/\  -  (2  and  A  and  B  follow  from  the  initial  conditions.  Because  there  is  still  an  oscillation,  this 
type  of  motion  is  called  underdamped.  In  contrast,  if  (  >  1,  the  roots  A+  in  equation  (  )  are  real,  and  we  get 

qualitatively  different,  overdamped  behavior,  in  which  x  returns  to  0  with  an  exponential  decay  without  any 
oscillations: 

x(f)  =  Ael+t  +  BeA  t  =  e_f"ot  [ Aent  +  Be~nt] ,  (8.13) 

where  Q  =  coq \J(2  -  1.  Naturally  the  boundary  case  is  when  (  —  1 ,  which  is  a  critically  damped  oscillator  -  the 
fastest  return  to  0  without  oscillations.  Because  in  this  case  equation  (8.1  )  only  has  one  root,  we  again  get  a 
qualitatively  different  solution: 

jc(f)  =  {A  +  Bt)e~Wot.  (8.14) 

The  three  different  cases  and  the  undamped  oscillation  are  shown  in  figure  3.2. 

2See  appendix  A.  3. 2  for  the  mathematical  details  on  howto  solve  general  equations  of  this  type. 
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8.3.  Driven  harmonic  oscillator 

A  mass  on  a  spring,  displaced  out  of  its  equilibrium  position,  will  oscillate  about  that  equilibrium  for  all 
time  if  undamped,  or  relax  towards  that  equilibrium  when  damped.  Its  amplitude  will  remain  constant  in 
the  first  case,  and  decrease  monotonically  in  the  second.  However,  if  we  give  the  mass  a  periodic  small 
push  at  the  right  moment  in  its  oscillation  cycle,  its  amplitude  can  increase,  and  even  diverge.  To  see  how 
this  works  we  study  the  driven  oscillator,  where  we  apply  a  periodic  driving  force  Fo(t)  =  Fd  cos(wd  t)  = 
|Fd  [eMot  +  e_1"[,/).  Adding  this  driving  force  to  the  equation  of  motion  (  )  of  a  damped  harmonic  os¬ 

cillator,  we  obtain: 

x  +  2(i)0(x  +  u)lx=  ^(eiWDf  +  e_'"Df).  (8.15) 

We  already  know  the  homogeneous  solution  to  equation  (8.1'  )  -  that’s  just  the  damped  oscillator  again,  so 
depending  on  the  value  of  (,  we  get  one  of  the  three  possible  solutions  of  the  previous  section.  To  find  a 
particular  solution,  we  first  note  that  we  can  split  the  driving  term  in  two  -  if  we  have  a  particular  solution 
for  each  of  the  oscillating  exponentials,  we  can  simply  add  them.  Also,  these  exponentials  themselves  look 
very  similar  to  the  underdamped  solutions,  so  they  may  make  a  good  guess  for  a  particular  solution.  For  a 
right-hand  side  of  ( Fnl2m)e±lWDt  we  therefore  try  xp  =  Ae±u"v‘l.  Substituting  this  into  equation  (  )  with 

the  appropriate  right-hand  side,  we  get: 

A(-(o^±2i(o0((Ou  +  a)l)e±iu>ot=^-e±ia,Dt,  (8.16) 

so  we  find  that  we  have  indeed  a  solution  if  the  amplitude  is  given  by 


A(w  d)  = 


_ Fd _ 

2m  («q  + 


(8.17) 


The  full  particular  solution  of  equation  (8.15)  is  then  given  by 


Fd 

Xplt)  = 

2  m 

_  Fd 
m 


0iu)Y)t 


e~ia)Y)t 


o)q  +  2io)o(ci)D  -  a)Q-2i(i)o(ct>D-io 

(<yp  -  ojI.)  cos(<dd  t)  +  2wo(o>d  sin(&>D  t ) 


Fd 


mR[ct)  d) 


(Wq  _  wd)2  +  4wo(2md 
COs(wDt-  0(Wd)) 


(8.18) 


where  the  factor  R(o>d)  in  the  amplitude  is  defined  by 

R2(wd)  =  («2-w2)2  +  4w2c2^,  (8.19) 

and  the  phase  (ji(ftio)  by  cos <p  =  (o)p  -  &i2  )/R(<u d),  sirup  -  2cjq(,(i>dI  R{(i>d),  so 

2(OqF,(i)d 

tan(0(wD))  =  ?  ,  .  (8.20) 


Resonance,  a  large  response  of  the  harmonic  oscillator  to  a  small  driving  force,  occurs  when  xp(r)  blows  up, 
or  !?(&)□)  g°es  to  zero.  That  does  not  always  happen,  but  R{o d)  can  reach  a  minimum  at  which  the  amplitude 
becomes  large: 


0  = 


dl?2 

dwD 


-4(Wq  -  (jp)(Jd  +  8wq(2cdd, 


(8.21) 


which  is  at 

.  .2  _  .  ,2  2/2 


(8.22) 


or  Wq  -  o)(t  if  the  damping  factor  (  is  small.  Note  that  in  this  same  limit  (small  f),  we  find  that  when  Wq  -  o)n, 
tarup  oo,  so  <p  n 1 2.  Therefore,  in  this  case  the  driving  happens  out  of  phase  with  the  response,  that  is  to 
say,  you  push  hardest  when  the  mass  is  at  its  point  of  maximum  speed,  increasing  that  speed  even  further, 
and  leading  to  an  increase  in  amplitude.  In  practice,  this  is  what  kids  do  when  they  sit  on  a  swing:  they  fling 
back  their  legs  when  they  go  through  the  lowest  point  (maximum  speed)  going  backwards,  and  fling  their  legs 
forward  at  the  same  point  when  going  forwards,  increasing  their  speed  and  thus  amplitude. 
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Figure  8.3:  Motion  of  two  coupled  pendulums,  (a)  Sketch  of  the  setup.  Two  identical  pendulums  of  length  L  and  mass  m  are  con¬ 
nected  through  a  weak  spring  of  spring  constant  k.  As  our  initial  condition  we  choose  both  pendulums  at  rest,  with  the  right  one  in  its 
equilibrium  position  and  the  left  one  given  a  finite  amplitude,  (b)  Resulting  motion  of  the  two  pendulums:  left  (blue)  and  right  (orange). 


8.4.  Coupled  oscillators 

8.4.1. 

A  beautiful  demonstration  of  how  energy  can  be  transferred  from  one  oscillator  to  another  is  provided  by  two 
weakly  coupled  pendulums.  Imagine  we  have  two  identical  pendulums  of  length  L  and  mass  m,  which  are 
connected  by  a  weak  spring  with  spring  constant  k  (figure  8  a).  The  equation  of  motion  of  the  combined 
system  is  then  given  by: 


LQ\  =  -gsindi  -  fcL(sin0i  -  sin02),  (8.23a) 

L6 2  =  -g sin02  +  fcL(sin0i  -  sinful-  (8.23b) 


We  will  once  again  use  the  small  angle  expansion  in  which  we  can  approximate  sin0  ~  0,  and  identify  ojq  - 
yj gIL  as  the  frequency  of  each  of  the  (uncoupled)  pendulums.  Equations  (  )  then  become 

6\  =  -co^di  -  kOi  +  k02,  (8.24a) 

62- -(^o02  + kd\- k02-  (8.24b) 

We  can  solve  the  system  of  coupled  equations  (8.24)  easily  by  introducing  two  new  variables:  a -61  +  62  and 
P  -  61-62,  which  gives  us  two  uncoupled  equations: 

a  -  -(Oq/x,  (8.25a) 

P=-(D20p-2kp  =  -((D')2pt  (8.25b) 

where  (o/)2  =  of2  +  2k  or  (o'  =  \J2 k+  gIL.  Since  equations  (  )  and  (  )  are  simply  the  equations  of 

harmonic  oscillators,  we  can  write  down  their  solutions  immediately: 


a(f)  =  Acos{a>ot  +  (f>o),  (8.26a) 

jS(t)  =  8cos(w'f  +  0').  (8.26b) 

Converting  back  to  the  original  variables  6 1  and  O2  is  also  straightforward,  and  gives 

1  A  B 

9i  —  -  (a  +  p)  =  —  cos(«o  t  +  (po)  +  —  cos(o)  t  +  <p),  (8.27a) 

1  A  B 

62  =  -  (a  —  p)  =  —  cos(«of  +  0o)  -  —  cos(cir  t  +  <p  ).  (8.27b) 

Let’s  put  in  some  specific  initial  conditions:  we  leave  pendulum  number  2  at  rest  in  its  equilibrium  posi¬ 
tion  (02(0)  =  02(0)  =  0  and  give  pendulum  number  1  a  finite  amplitude  but  also  release  it  at  rest  (0i(O)  =  0(l, 
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01  (0)  =  0).  Working  out  the  four  unknowns  (A,  B,  (po  and  </>')  is  straightforward,  and  we  get: 

„  00  00  ,  _  (  + <l>'  )  ( (I) 0  —  CO1  \ 

01  =  —  cos(wot)  +  —  cos (w  t)  =  0o  cos  I  — - —  1 1  cos  I  — - —  1 1 ,  (8.28a) 

„  00  00  ,  „  ( Wo  +  Cl/  1  id)'  —  Wo  1 

02  —  —  cos(wot)  -  —  cos(w  f)  =  0o  sml  — — —  f  I  sml  — - —  f  I .  (8.28b) 

The  solution  given  by  equations  (8.28)  is  plotted  in  figure  8.3b.  Note  that  the  solutions  have  two  frequencies 
(known  as  the  eigenfrequencies  of  the  system) .  The  fast  one,  |  (wo + w') ,  which  for  a  weak  coupling  constant  k  is 
very  close  to  the  eigenfrequency  wo  of  a  single  pendulum,  is  the  frequency  at  which  the  pendulums  oscillate. 
The  do  so  in  anti-phase,  as  expressed  mathematically  by  the  fact  that  one  oscillation  has  a  sine  and  the  other 
a  cosine  (which  is  of  course  just  a  sine  shifted  over  n/2).  The  second  frequency,  i(w'-wo)  is  much  slower,  and 
represents  the  frequency  at  which  the  two  pendulums  transfer  energy  to  each  other,  through  the  spring  that 
couples  them.  In  figure  8.3b,  it  is  the  frequency  of  the  envelope  of  the  amplitude  of  the  oscillation  of  either  of 
the  pendulums.  All  these  phenomena  will  return  in  the  next  section,  in  the  study  of  waves,  which  travel  in  a 
medium  in  which  many  oscillators  are  coupled  to  one  another. 


8.4.2. 

For  a  system  with  only  two  oscillators,  the  technique  we  used  above  for  solving  the  system  of  coupled  equa¬ 
tions  (8.2  )  is  straightforward.  It  does  however  not  generalize  easily  to  systems  with  many  oscillators.  Instead, 
we  can  exploit  the  fact  that  the  equations  are  linear  and  use  techniques  from  linear  algebra  (as  you  may  have 
guessed  from  the  term  eigenfrequency).  We  can  rewrite  equations  (8.2:  )  in  matrix  form: 


_dM0i  j  (-(wg  +  fc)  k  U0i! 

df2  {  02  )  {  k  -(w l  +  k)  )  (  02  )' 


(8.29) 


Equation  (8.2  )  is  a  homogeneous,  second-order  differential  equation  with  constant  coefficients,  strongly 
resembling  the  equation  for  a  simple,  one-dimensional  harmonic  oscillator.  Consequently,  we  may  expect 
the  solutions  to  look  similar  as  well,  so  we  try  our  usual  Ansatz: 


0i 

02 


where  Ci  and  C2  are  constants.  Substituting  (8.3  )  in  (8.  )  gives 


o)q  +  k  -k 
-k  ct>Q+k 


Ci 

C2 


Ci 

C2 


which  you  hopefully  recognize  as  an  eigenvalue  problem.  Solving  for  the  eigenvalues  o>2  gives: 

(-i o 2  +  £1*0  +  k)2  -  k2  -  0. 


(8.30) 


(8.31) 


(8.32) 


The  solutions  of  equation  (8.3:  )  unsurprisingly  reproduce  the  frequencies  of  the  uncoupled  equations  in 
section  8.4.1: 

=  dJg,  (t)2  =  tog  +  21c.  (8.33) 

The  eigenvectors  of  (8.3  )  are  given  by 

C+  =  -^=  |  |  |  for  a) +  and  C  =  |  ^  j  for  w_.  (8.34) 

Note  that  the  eigenvectors  are  orthogonal;  this  is  a  general  property  of  the  eigenvectors  of  symmetric  matri¬ 
ces.  Each  eigenvector  corresponds  to  a  possible  steady-state  of  motion  of  the  system;  these  states  are  known 
as  the  normal  modes  (‘normal’  referring  to  the  orthogonality  of  the  eigenvectors).  We  can  now  immediately 
write  down  the  most  general  solution  of  equation  (  )  as  a  linear  combination  of  the  eigenmodes: 

(8-35> 


where  the  amplitudes  A+  and  phases  (p+  are  determined  by  the  initial  conditions. 
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Writing  our  system  of  equations  in  matrix  form  allows  us  to  easily  generalize  both  to  asymmetric  con¬ 
figurations  (see  problem  8.4.  )  and  to  systems  with  many  coupled  oscillators.  An  important  example  of  the 
latter  case  is  the  study  of  vibrations  in  solids.  Atoms  or  ions  in  solids  typically  form  a  crystal  lattice,  that  can 
be  modeled  as  a  large  number  of  masses  coupled  by  springs.  Such  crystals  can  have  complicated  vibrational 
properties,  that  can  be  analyzed  in  terms  of  its  normal  modes.  In  particular  the  modes  with  a  low  energy  can 
typically  be  accessed  easily.  They  are  known  as  phonons,  and  correspond  to  sound  waves  in  the  solid. 
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8.5.  Problems 

8.1  An  object  undergoes  simple  harmonic  motion  of  amplitude  A  and  angular  frequency  a>  about  the  equi¬ 
librium  point  x  =  0.  Find  the  speed  v  of  the  object  in  terms  of  A,  w,  and  x.  Hint :  use  conservation  of 
energy. 

8.2  A  disk  of  radius  R  and  mass  M  is  suspended  from  a  pivot  somewhere  between  its  center  and  its  edge,  see 
figure  below.  For  what  pivot  point  (i.e.,  which  distance  d)  will  the  period  of  this  physical  pendulum  be  a 
minimum  (or  equivalently  the  frequency  a  maximum)?  You  may  find  one  of  the  theorems  we  proved  in 
chapter  5  useful  in  answering  this  question. 


8.3  Figure  8.4  shows  a  common  present-day  seesaw  design,  also  featured  in  problem  2. 1C .  In  addition  to 
a  beam  with  two  seats,  this  seesaw  also  contains  two  identical  springs  (with  spring  constant  10  kN/m) 
that  connect  the  beam  to  the  ground.  The  distance  between  the  pivot  and  each  of  the  springs  is  30.0  cm, 
the  distance  between  the  pivot  and  each  of  the  seats  is  1.50  m.  Two  children  sit  on  the  two  seats.  Both 


Figure  8.4:  Seesaw  with  two  springs. 


children  kick  off  against  the  ground  a  couple  of  time,  putting  the  seesaw  in  an  oscillating  motion  with  an 
amplitude  of  50.0  cm.  At  t  =  0  the  children  stop  kicking.  The  plot  in  figure  c  shows  the  height  of  one  of 
the  seats  as  a  function  of  time  afterwards. 

(a)  In  what  type  of  motion  is  the  seesaw  after  the  children  stop  kicking? 

(b)  You  could  model  the  seesaw  with  the  two  children  as  a  simple  mass-on-a-spring,  with  a  spring  con¬ 
stant  twice  that  of  the  individual  spring  in  the  seesaw.  Using  the  graph  in  figure  c,  estimate  the  effec¬ 
tive  mass  of  this  system. 

(c)  After  a  while,  the  children  resume  kicking,  slowly  bringing  their  amplitude  back  up  to  50.0  cm.  Using 
the  mass-spring  system  of  (b),  estimate  the  amount  of  energy  the  children  have  to  put  in  per  period 
to  achieve  this. 

8.4  A  block  with  mass  mi  -  1.5  kg  block  is  supported  by  a  frictionless  surface  and  attached  to  a  horizontal 
spring  of  constant  k-  22  N/m,  as  shown  in  the  figure.  The  block  oscillates  with  an  amplitude  of  10.0  cm, 
executing  a  simple  harmonic  motion. 
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(a)  Find  the  frequency  to  of  the  oscillation  of  the  block. 

(b)  Write  down  the  equation  for  the  position  of  the  block  as  a  function  of  time,  x(r),  in  such  a  form  that 
it  is  at  its  rightmost  position  at  t  —  0. 

A  second  block  of  mass  0.80  kg  moves  in  from  the  right  at  2.5  m/s  and  hits  the  first  block  at  t  =  0,  i.e., 
when  it  is  in  its  rightmost  position.  The  two  blocks  then  stick  together  and  continue  moving  as  one. 

(a)  Which  quantity  /  quantities  are  conserved  during  the  collision? 

(b)  Determine  the  frequency  of  the  motion  of  the  two  blocks  after  the  collision. 

(c)  Determine  the  amplitude  of  the  motion  of  the  two  blocks  after  the  collision. 

8.5  Suppose  you  are  stranded  on  an  unknown  planet  with  nothing  but  a  physical  pendulum  and  a  stopwatch. 
You  determined  the  properties  of  the  pendulum  back  on  Earth,  and  found  m  =  2.0  kg,  h  =  0.50  m  and 
I  -  3.0  kg-  m2.  Having  nothing  better  to  do,  you  measure  the  time  it  takes  your  pendulum  to  complete  50 
cycles,  and  find  that  this  time  equals  170  s.  Use  this  information  to  compute  the  value  of  the  gravitational 
acceleration  g  on  your  new  home  world. 

8.6  For  a  damped  harmonic  oscillator  driven  by  a  sinusoidal  force  (as  in  equation  ),  find  the  average 
power  dissipated  per  (driving)  period.  Hint :  use  P  -  F  •  v. 

8.7  Consider  a  system  of  two  coupled  harmonic  oscillators,  where  one  (with  mass  2m  and  spring  constant  2k) 
is  suspended  from  the  ceiling,  and  the  other  (with  mass  m  and  spring  constant  k)  is  suspended  from  the 
first,  as  shown  in  the  figure. 

(a)  Find  the  equation  of  motion  of  this  system  of  coupled  oscillators,  and  write  it  in  matrix  form.  For 
each  mass,  use  coordinates  in  which  the  zero  is  at  the  equilibrium  position. 

(b)  Find  the  frequencies  of  the  normal  modes  of  this  coupled  system. 


Waves 


In  physics  a  wave  is  a  disturbance  or  oscillation  that  travels  through  space  accompanied  by  a  transfer  of 
energy,  and  may  be  propagated  with  little  or  no  net  motion  of  the  medium  involved.  In  this  section  we  will 
consider  mechanical  waves,  in  which  the  particles  in  a  material  are  oscillating.  Examples  are  the  waves  in 
the  sea,  the  wave  in  the  crowd  at  a  stadium,  and  sound.  Later  on  we  will  encounter  electromagnetic  waves  in 
which  electric  and  magnetic  fields  are  oscillating,  and  which  can  travel  through  vacuum.  Examples  are  light 
and  radio  signals.  In  quantum  mechanics,  we  will  also  encounter  what  are  sometimes  referred  to  as  matter 
waves,  where  fundamental  objects  that  we  usually  think  of  as  particles,  such  as  electrons  and  protons,  can 
also  be  considered  as  waves.  Finally,  recently  gravitational  waves  were  discovered,  which  are  vibrations  of 
spacetime  itself. 

By  observing  a  particle,  we  know  in  which  direction  it  moves  at  any  given  time.  However,  as  I  just  stated, 
the  particles  in  a  mechanical  wave  have  no,  or  almost  no,  net  motion  as  the  wave  passes.  The  wave  does 
have  a  well-defined  direction  though:  the  direction  in  which  energy  is  transferred.  Some  waves  spread  out 
uniformly,  such  as  a  sound  wave  emanating  from  a  point  source.  Others  are  restricted  in  their  motion  by  the 
properties  of  the  material  they  travel  in,  such  as  a  wave  in  a  string,  or  by  boundary  conditions,  such  as  the 
end  of  that  string.  For  waves  that  move  (predominantly)  in  one  direction,  we  can  distinguish  two  fundamental 
types,  illustrated  in  figure  .  The  first  type  is  the  case  that  the  particles  oscillate  in  the  same  direction  as  the 
wave  is  moving  (figure  a),  which  we  call  a  longitudinal  wave;  sound  is  an  example.  The  second  case  is 
that  the  particles  oscillate  in  a  direction  perpendicular  to  the  wave  motion,  which  we  call  a  transverse  wave 
(figure  9.  b),  of  which  the  waves  in  a  pond  are  an  example. 

9.1.  Sinusoidal  waves 

Probably  the  simplest  kind  of  wave  is  a  transverse  sinusoidal  wave  in  a  one-dimensional  string.  In  such  a  wave 
each  point  of  the  string  undergoes  a  harmonic  oscillation.  We  will  call  the  displacement  from  equilibrium  u, 
then  we  can  plot  u  as  a  function  of  position  on  the  string  at  a  given  point  in  time,  figure  9.2a,  which  is  a 
snapshot  of  the  wave.  Alternatively,  we  can  plot  u  as  a  function  of  time  for  a  given  point  (with  given  position) 
on  the  string,  figure  9.2b.  Because  the  oscillation  is  harmonic,  the  displacement  as  a  function  of  time  is  a  sine 
function,  with  an  amplitude  (maximum  displacement)  A  and  a  period  (time  between  maxima)  T. 

By  definition,  each  point  of  the  string  undergoing  a  sinusoidal  wave  undergoes  a  harmonic  oscillation, 
so  for  each  point  we  can  write  u{t)  =  Acos(wr+  <p)  (equation  ),  where  as  before  w  -  2ttIT  is  the  (angular) 
frequency  and  <p  the  phase.  Two  neighboring  points  on  the  string  are  slightly  out  of  phase  -  if  the  wave  is 
traveling  to  the  right,  then  your  right-hand  neighbor  will  reach  maximum  slightly  later  than  you,  and  thus  has 
a  slightly  larger  phase.  The  difference  in  phase  is  directly  proportional  to  the  distance  between  two  points, 
coming  to  a  full  2 n  (which  is  of  course  equivalent  to  zero)  after  a  distance  A,  the  wavelength.  The  wavelength 
is  thus  the  distance  between  two  successive  points  with  the  same  phase,  in  particular  between  two  maxima. 
In  between  these  maxima,  the  phase  runs  over  the  full  2 n,  so  the  wave  is  also  a  sinusoid  in  space,  with  a 
‘spatial  frequency’  or  wavenumber  k  -  2nl  A.  Combining  the  dependencies  on  space  and  time  in  a  single 
expression,  we  can  write  for  the  sinusoidal  wave: 

u[x,  t)  —  Acos[kx- cjt).  (9.1) 
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Figure  9.1:  Two  basic  types  of  waves,  (a)  Longitudinal  wave,  where  the  oscillatory  motion  of  the  particles  is  in  the  same  direction  as  that 
of  the  wave,  (b)  Transverse  wave,  where  the  oscillatory  motion  of  the  particles  is  perpendicular  to  that  of  the  wave. 


The  speed  of  the  wave  is  the  distance  the  wave  travels  per  unit  time.  A  unit  time  for  a  wave  is  one  period, 
as  that  is  the  time  it  takes  the  oscillation  to  return  to  its  original  point.  The  distance  traveled  in  one  period  is 
one  wavelength,  as  that  is  the  distance  between  two  maxima.  The  speed  is  therefore  simply  their  ratio,  which 
can  also  be  expressed  in  terms  of  the  wave  number  and  frequency: 


^w  = 


A 

T 


co 

k 


(9.2) 


9.2.  IE  WAVE  EQUATION 

As  with  all  phenomena  in  classical  mechanics,  the  motion  of  the  particles  in  a  wave,  for  instance  the  masses 
on  springs  in  figure  9.1,  are  governed  by  Newton’s  laws  of  motion  and  the  various  force  laws.  In  this  section 
we  will  use  these  laws  to  derive  an  equation  of  motion  for  the  wave  itself,  which  applies  quite  generally  to 
wave  phenomena.  To  do  so,  consider  a  series  of  particles  of  equal  mass  m  connected  by  springs  of  spring 
constant  k,  again  as  in  figure  9.1a,  and  assume  that  at  rest  the  distance  between  any  two  masses  is  h.  Let  the 
position  of  particle  i  be  x,  and  u  the  distance  that  particle  is  away  from  its  rest  position;  then  u  =  xrest  -  x  is 
a  function  of  both  position  x  and  time  t.  Suppose  particle  i  has  moved  to  the  left,  then  it  will  feel  a  restoring 
force  to  the  right  due  to  two  sources:  the  compressed  spring  on  its  left,  and  the  extended  spring  on  its  right. 
The  total  force  to  the  right  is  then  given  by: 

Pi  =  Pi+i^i  ~  Fi-i^i 

-  k  [u(x+  h,  t)  -  u[x,  t )]  -  k  [u{x,  t )  -  u(x-  h,  ij] 

=  k  [m(x+  h,  t)  -2u[x,  t)  +  u{x-  h,  t)]  (9.3) 


Equation  ( 1.3)  gives  the  net  force  on  particle  i,  which  by  Newton’s  second  law  of  motion  (equation  2.5)  equals 
the  particle’s  mass  times  its  acceleration.  The  acceleration  is  the  second  time  derivative  of  the  position  x, 
but  since  the  equilibrium  position  is  a  constant,  it  is  also  the  second  time  derivative  of  the  distance  from  the 
equilibrium  position  u{x,  t),  and  we  have: 


Tnet  — 


m 


d2u{x,  t) 
dt 2 


k[u(x+  h,  t)  -2u[x,  t )  +  u{x-  h,  f)] . 


(9.4) 


Equation  (9.-  )  holds  for  particle  i,  but  just  as  well  for  particle  i  +  1,  or  i  -  10.  We  can  get  an  equation  for  N 
particles  by  simply  adding  their  individual  equations,  which  we  can  do  because  these  equations  are  linear. 
We  thus  find  for  a  string  of  particles  of  length  L  —  Nh  and  total  mass  M  —  Nnr. 


d2u{x,  t) 
dt2 


KL2 


u{x+  h,  t)  -  2 u(x,  t)  +  u{x -  h,  t) 

h2 


M 


(9.5) 
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Figure  9.2:  A  sinusoidal  transverse  wave  in  space  (a)  and  time  (b).  The  distance  between  two  successive  maxima  (or  any  two  successive 
points  with  equal  phase  is  the  wavelength  A  of  the  wave.  The  maximum  displacement  is  the  amplitude  A,  and  the  time  it  takes  a  single 
point  to  go  through  a  full  oscillation  is  the  period  T. 


Here  K  —  kl  N  is  the  effective  spring  constant  of  the  N  springs  in  series.  Now  take  a  close  look  at  the  fraction 
on  the  right  hand  side  of  equation  (9.5):  if  we  take  the  limit  h  —> •  0,  this  is  the  second  derivative  of  u(x,  t)  with 
respect  to  x.  However,  taking  h  to  zero  also  takes  L  to  zero  -  which  we  can  counteract  by  simultaneously  taking 
IV  — *  oo,  in  such  a  way  that  their  product  L  remains  the  same.  What  we  end  up  with  is  a  string  of  infinitely 
many  particles  connected  by  infinitely  many  springs  -  so  a  continuum  of  particles  and  springs,  for  which  the 
equation  of  motion  is  given  by  the  wave  equation: 

d2u{x,t )  0d2u{x,t) 

~^-V-^x^-  (9‘6) 

In  equation  (  ),  vw  -  V KL2IM  (sometimes  also  denoted  by  c)  is  the  wave  speed. 

For  a  wave  in  a  taut  string,  the  one-dimensional  description  is  accurate,  and  we  can  relate  our  quantities 
K,  L  and  M  to  more  familiar  properties  of  the  string:  its  tension  T  —  KL  with  the  dimension  of  a  force  (this  is 
simply  Hooke’s  law  again)  and  its  mass  per  unit  length  p-  MIL,  so  we  get 


^string  — 


(9.7) 


In  two  or  three  dimensions,  the  spatial  derivative  in  equation  (9.6)  becomes  a  Laplacian  operator,  and  the 
wave  equation  is  given  by: 


d2u{x,  t ) 
dt2 


i/^V2  u{x,  t ) 


(9.8) 


As  can  be  readily  seen  by  writing  equation  (9.8)  in  terms  of  spherical  coordinates,  if  the  wave  is  radial 
(i.e.,  only  depends  on  the  distance  to  the  source  r,  and  not  on  the  angle),  the  quantity  r  u(r)  obeys  the  one¬ 
dimensional  wave  equation,  so  we  can  write  down  the  equation  for  u[r )  immediately.  An  important  applica¬ 
tion  are  sound  waves,  which  spread  uniformly  in  a  uniform  medium.  To  find  their  speed,  we  characterize  the 
medium  in  a  similar  fashion  as  we  did  for  the  string:  we  take  the  mass  per  unit  volume,  which  is  simply  the 
density  p,  and  the  medium’s  bulk  modulus,  which  is  a  measure  for  the  medium’s  resistance  to  compression 
(i.e.,  a  kind  of  three-dimensional  analog  of  the  spring  constant),  defined  as: 


dp  dp 
dV  ^  dp 


(9.9) 


where  p  is  the  pressure  (force  per  unit  area)  and  V  the  volume.  The  bulk  modulus  is  also  sometimes  denoted 
as  K.  The  dimensions  of  the  bulk  modulus  are  those  of  a  pressure,  or  force  per  unit  area,  and  those  of  the 
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density  are  mass  per  unit  volume,  so  their  ratio  has  the  dimension  of  a  speed  squared,  and  the  speed  of 
sound  is  given  by: 


^sound  — 


(9.10) 


Equation  (9.8)  describes  a  wave  characterized  by  a  one-dimensional  displacement  (either  longitudinal 
or  transverse)  in  three  dimensions.  In  general  a  wave  can  have  components  of  both,  and  the  displacement 
itself  becomes  a  vector  quantity,  u{x,  t).  In  that  case  the  three-dimensional  wave  equation  takes  on  a  more 
complex  form: 

d2u{x,t)  4 

p - f—  =  /+(£+  -  G)V(V  •  u{x,  t))  -  GV  x  (V  x  u{x,  f)),  (9.11) 

dt 1  3 

where  /  is  the  driving  force  (per  unit  volume),  B  again  the  bulk  modulus,  and  G  the  material’s  shear  modulus. 
Equation  (9.11)  is  used  for  the  description  of  seismic  waves  in  the  Earth  and  the  ultrasonic  waves  with  which 
solid  materials  are  probed  for  defects. 


9.3.  Solution  of  the  one-dimensional  wave  equation 

The  one-dimensional  wave  equation  (  )  has  a  surprisingly  generic  solution,  due  to  the  fact  that  it  contains 

second  derivatives  in  both  space  and  time.  As  you  can  readily  see  by  inspection,  the  function  q[x,  t)  —  x—  vwt 
is  a  solution,  as  is  the  same  function  with  a  plus  instead  of  a  minus  sign.  These  functions  represent  waves 
traveling  to  the  right  (minus)  or  left  (plus)  at  speed  vw.  Elowever,  the  shape  of  the  wave  does  not  matter  -  any 
function  F{q)  =  Fix  -  vw  t)  is  a  solution  of  (  .6),  as  is  any  function  G(x+  vwt),  and  the  general  solution  is  the 
sum  of  these: 

u{x,t)  -  F[x- vwt)  +  G(x+ vwt).  (9.12) 

To  find  a  specific  solution,  we  need  to  look  at  the  initial  conditions  of  the  wave,  i.e.,  the  conditions  at  t  -  0. 
Because  the  wave  equation  is  second  order  in  time,  we  need  to  specify  both  the  initial  displacement  and  the 
displacement’s  initial  velocity,  which  can  be  functions  of  the  position.  For  the  most  general  case  we  write: 

u{x,  0)  =  uq{x),  (9.13) 

ii[x,  0)  =  v0{x).  (9-14) 

The  resulting  solution  of  the  one-dimensional  wave  equation  is  known  as  d'Alembert’s  equation : 


9 


u{x,  t)  =  -  (uq{x- vwt)  +  uq{x+  vwt))  + -  /  v{y)dy.  (9.15) 

2  2  vw  Jx-  vw  t 

9.4.  Wave  superposition 

The  wave  equation  (  .6)  is  linear  in  the  function  we’re  interested  in,  the  displacement  u(x,  t).  This  simple 
mathematical  statement  has  important  consequences,  because  it  means  that  if  we  know  any  set  of  solutions, 
we  can  create  more  solutions  by  making  linear  combinations  of  them  -  so  if  u\  [x,  t )  and  «2  (x,  t)  are  solutions, 
then  so  are  a  u\  [x,  t)  +  b  112  [x,  t)  for  any  choice  of  a  and  b.  In  physics,  this  useful  property  of  linear  differential 
equations  is  known  as  the  principle  of  superposition.  Thanks  to  this  principle,  we  can  study  how  different 
waves  interact  with  each  other  without  having  to  do  (much)  extra  math. 

To  illustrate,  let  us  consider  two  one-dimensional  waves  traveling  in  opposite  directions,  figure  .  As 
long  as  the  waves  do  not  overlap,  the  oscillation  of  any  given  particle  is  due  to  only  one  wave,  and  there  is  no 
interaction.  Elowever,  as  soon  as  the  waves  start  overlapping,  the  oscillations  add  up,  which  leads  to  interfer¬ 
ence.  At  some  points,  the  two  oscillations  will  be  in  phase,  resulting  in  a  much  larger  oscillation  amplitude, 
which  we  call  constructive  interference  (figure  3b).  At  other  points,  the  two  oscillations  will  be  out  of  phase, 
resulting  in  a  much  smaller,  or  even  vanishing  oscillation  amplitude,  which  we  call  destructive  interference 
(figure  9.3c).  However,  the  waves  themselves  remain  unaffected,  and  transmit  right  through  each  other,  con¬ 
tinuing  their  path  as  if  nothing  had  happened  (figure  3.3d). 

Waves  reaching  the  end  of  a  string,  or  edge  of  a  pond,  or  any  type  of  boundary,  will  not  simply  disap¬ 
pear.  Remember,  waves  carry  energy,  and  that  energy  is  conserved,  so  it  has  to  go  somewhere  once  the  wave 
reaches  the  boundary.  If  there’s  nothing  at  the  boundary,  the  waves  are  reflected  back  into  the  material.  This 
happens  in  two  cases:  a  (perfectly)  fixed  boundary,  and  a  (perfectly)  free  boundary;  in  other  cases,  some  of 
the  energy  may  be  transmitted  to  material  on  the  other  side  of  the  boundary  (starting  a  new  wave  there), 
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Figure  9.3:  Two  interacting  wave  packets.  The  sequence  of  images  shows  four  snapshots.  The  blue  wave  is  traveling  to  the  right,  the  red 
wave  to  the  left  (a).  When  the  waves  overlap,  the  total  displacement  of  the  particle  is  given  by  the  sum  of  the  displacements  due  to  both 
waves,  shown  in  green.  This  can  lead  to  both  constructive  interference  (b),  when  the  two  waves  are  in  phase,  and  destructive  interference 
(c),  when  their  phases  are  opposite.  The  waves  themselves  are  not  affected  by  the  interaction  and  afterwards  travel  on  as  if  nothing  has 
happened  (d). 
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whereas  the  remainder  is  reflected  back  into  the  original  material  with  less  energy,  resulting  in  a  smaller  wave 
amplitude.  A  reflected  wave  travels  in  the  opposite  direction  to  the  original  wave,  so  it  can  interfere  with  itself. 
In  fact,  this  interference  is  a  crucial  point  for  being  able  to  meet  the  boundary  conditions.  A  fixed  boundary 
cannot  move,  so  there  must  be  destructive  interference  keeping  the  amplitude  there  zero  at  all  times  -  so  it 
follows  that  a  wave  reflecting  on  a  fixed  boundary  undergoes  a  n  phase  shift.  Free  boundaries  on  the  other 
hand  are  perfectly  free  to  move,  so  there  is  nothing  holding  it  back  from  reaching  the  maximal  displacement 
that  can  be  achieved  by  constructive  interference,  and  the  wave  reflects  without  a  phase  shift. 

If  you  put  boundaries  on  both  ends  of  a  string,  the  wave  keeps  reflecting  back  and  forth,  continuously  in¬ 
terfering  with  itself.  To  find  the  resulting  shape  of  the  string,  we’re  going  to  use  the  principle  of  superposition 
for  a  simple  sinusoidal  wave.  Let  tq(x,  t)  -  A  cos(/cx  -  cot)  be  the  part  of  the  wave  traveling  to  the  right,  and 
U2[x,  t)  =  —  A  cos  ( lex o-cot)  be  the  part  traveling  to  the  left.  Note  the  differences:  the  waves  have  opposite  signs 
for  their  speeds,  and  opposite  signs  for  their  displacements,  the  latter  because  of  the  n  phase  shift  (we  could 
also  write  112  (x,  t )  =  Acos(kx  +  cot  +  n)).  The  shape  of  the  string  is  now  simply  the  sum  of  these  two  waves: 

u{x,  t)  =  u\{x,  t)  +  U2[x,  t)  —  A  [cosffcx-  tot)  -  cos(A;x  +  cot)] 

=  2Asin(fcx)  sin(wf).  (9.16) 

Equation  (  )  tells  us  that  for  a  self-interfering  wave,  the  wave  no  longer  moves  -  instead,  each  point  simply 

oscillates  with  frequency  to  at  a  position-dependent  amplitude  2  A  sin  (/ex).  We  call  such  a  wave  a  standing 
wave.  Standing  waves  are  very  common  -  you’ll  get  one  every  time  you’ll  touch  the  string  of  a  guitar  or  violin. 
Naturally,  they  are  not  restricted  to  one-dimensional  systems  -  the  skin  of  a  drum,  constrained  at  the  drum’s 
edge,  is  put  in  a  standing  wave  every  time  someone  hits  it. 

Equation  (9.1  )  describes  the  shape  of  a  standing  wave  on  a  string  clamped  at  both  ends.  If  the  string 
has  length  L,  then  by  the  nature  of  the  boundary  conditions,  we  must  have  u{ 0,  t)  =  u{L,t)  —  0  for  all  t.  The 
first  condition  follows  for  free  (which  is  of  course  just  due  to  a  good  choice  of  coordinates),  but  the  second 
puts  a  constraint  on  our  wave.  The  displacement  can  only  be  zero  at  all  times  if  the  amplitude  is  identically 
zero,  so  we  demand  that  sin(fcL)  =  0,  or  L  —  mtilk  -  mA.12,  where  m  is  any  positive  integer.  There  are  thus 
infinitely  many  allowed  standing  waves,  but  they  are  characterized  by  a  discrete  number.  The  allowed  waves 
are  known  as  modes,  and  the  associated  number  m  is  the  mode  number.  The  simplest  wave,  with  the  lowest 
possible  value,  m  -  1,  is  known  as  the  fundamental  mode.  In  the  fundamental  mode,  the  oscillation  of  the 
string  has  nonzero  amplitude  everywhere  but  at  the  fixed  ends;  for  higher  modes,  there  are  also  points  in 
between  that  have  zero  amplitude,  which  are  known  as  nodes;  points  where  the  amplitude  is  maximum  are 
sometimes  referred  to  as  antinodes. 

A  discrete  spectrum  of  allowed  solutions,  characterized  by  integer  numbers,  does  not  only  appear  in 
standing  mechanical  waves,  but  is  also  a  fundamental  aspect  of  quantum  mechanics. 


9.5.  Amplitude  modulation 

So  far,  we’ve  mostly  considered  simple  sinusoidal  waves  with  fixed  amplitudes.  However,  the  general  solution 
to  the  wave  equation  allows  for  many  more  interesting  wave  shapes.  An  important,  and  often  encountered 
one  is  where  the  wave  itself  is  used  as  the  medium,  by  changing  the  amplitude  over  time: 

u[x,  t)  =  A(x,  t)cos[kx- cot).  (9.17) 

The  wave  now  consists  of  two  waves:  the  carrier  wave,  which  travels  with  the  phase  velocity  vw  =  to  Ik,  and  the 
envelope,  which  travels  with  the  group  velocity  vg.  An  illustration  of  a  modulated  wave  is  shown  in  figure  9.5. 
In  the  common  case  that  the  group  velocity  is  independent  of  the  wavelength  of  the  carrier  wave,  we  can 
rewrite  (  )  to  reflect  the  fact  that  the  amplitude  is  now  also  a  wave,  with  speed  vg: 

u{x,  t)  =  A(x-  vgt)cos(kx- tot).  (9.18) 


9.6.  Sound  waves 

So  far,  we  mostly  considered  transversal  waves,  which  include  waves  in  strings  and  waves  on  the  surface  of 
a  pond,  and  are  easily  visualized.  Longitudinal  waves,  on  the  other  hand,  are  somewhat  harder  to  draw,  but 
easily  heard  -  as  sound  is  the  prime  example  of  a  longitudinal  wave.  Other  examples  include  (some  forms 
of)  seismic  waves  and  ultrasound.  Many  people  simply  lump  all  of  these  together,  and  use  the  terms  ‘sound 
waves’  and  ‘longitudinal  waves’  as  synonyms. 


9.7.  The  Doppler  effect 


99 


n  =  1 


n  =  2 


n  =  3 


Figure  9.4:  Snapshots  of  the  wave  patterns  for  the  first  three  modes  of  a  standing  wave  in  (a)  a  string  with  two  clamped  ends,  (b)  a  string 
with  one  clamped  end,  or  a  tube  with  one  closed  and  one  open  end,  and  (c)  a  tube  with  two  open  ends.  In  each  case,  the  plot  shows 
the  amplitude  of  the  standing  wave,  as  a  fraction  of  the  amplitude  of  the  traveling  wave.  At  nodes,  the  amplitude  of  the  standing  wave 
vanishes:  destructive  interference  causes  these  points  to  always  stand  still.  At  antinodes,  the  amplitude  of  the  standing  wave  is  maximal, 
and  equals  that  of  the  traveling  wave.  Note  that  (a)  and  (b)  will  occur  for  transversal  waves,  whereas  (b)  and  (c)  will  occur  for  sound 
waves.  For  cases  (a)  and  (c)  allowed  wavelengths  are  X  =  2 LI  n,  whereas  for  case  (b),  allowed  wavelengths  are  A  =  4 LI  (2n  —  1). 


We  already  touched  upon  the  speed  of  sound  waves  in  section  (equation  ).  This  speed  indicates 
how  fast  the  wavefronts  of  a  sound  wave  travel;  a  wavefront  is  defined  as  the  surface  (in  three  dimensions) 
where  all  points  have  the  same  phase  .  To  visualize  a  sound  wave  we  draw  a  succession  of  wavefronts  one 
wavelength  (or  one  period)  apart.  Simple  examples  include  a  point  source  (generating  spherical  wavefronts) 
and  a  planar  wave,  in  which  all  wave  fronts  are  parallel  planes  (see  figure  9.6).  The  (local)  direction  of  propa¬ 
gation  of  the  wave  is  the  direction  perpendicular  to  the  wavefronts,  sometimes  depicted  by  a  ray. 

Like  transversal  waves,  longitudinal  waves  exhibit  interference,  both  with  other  waves  they  encounter, 
and  by  their  own  reflections.  There  can  therefore  be  traveling  and  standing  sound  waves.  Unlike  transversal 
waves  on  strings  however,  longitudinal  sound  waves  are  typically  created  in  tubes  that  are  open  on  either 
one  or  both  ends.  A  closed  end  represents  a  fixed  point,  just  like  the  fixed  end  of  a  string  does,  resulting  in  a 
stringent  boundary  condition:  the  interference  between  the  incoming  and  outgoing  wave  must  be  such  that 
the  net  displacement  at  the  closed  end  is  zero.  An  open  end  corresponds  to  a  transversal  wave  in  a  string  that 
is  not  clamped.  As  the  string  in  that  case  is  free  to  move,  its  maximum  displacement  will  equal  the  amplitude 
of  the  wave.  In  other  words,  while  a  closed  end  corresponds  to  a  node,  the  open  end  corresponds  to  an 
antinode.  For  a  string  (or  pipe)  with  one  clamped  /  closed  and  one  open  end,  the  wavelength  of  the  lowest- 
order  mode  (known  as  the  fundamental  mode  or  first  harmonic)  therefore  equals  four  times  the  length  L  of 
the  string  /  pipe.  The  next  mode  (second  harmonic)  will  have  a  wavelength  of  4L/3,  and  so  on  (see  figure  b), 
resulting  in  A  =  4 LI (2 n  -  1)  for  the  nth  mode.  For  a  tube  with  two  open  ends,  the  fundamental  mode  is  the 
inverse  of  that  of  a  string  with  two  clamped  ends  -  so  two  antinodes  at  the  ends,  and  a  node  in  the  middle 
(figure  c).  Like  for  a  clamped  string,  the  allowed  wavelengths  are  therefore  A  =  2 L/n. 

9.7.  e  Doppler  effect 

The  Doppler  effect  is  a  physical  phenomenon  that  most  people  have  experienced  many  times:  when  a  moving 
source  of  sound  (say  an  ambulance,  or  more  exactly  its  siren)  is  approaching  you,  its  pitch  sounds  noticeably 
higher  then  after  it  passed  you  by  and  is  moving  away.  The  effect  is  due  to  the  fact  that  the  observed  wave¬ 
length  (and  therefore  frequency  /  pitch)  of  sound  corresponds  to  the  distance  between  two  points  of  equal 
phase  (i.e.,  two  sequential  wavefronts).  Ultimately,  the  Doppler  effect  thus  originates  in  a  change  in  refer¬ 
ence  frame  (the  same  frames  we  encountered  in  section  ) :  what  you  hear  is  indeed  different  from  what  the 

^wavefront  corresponding  to  the  maximum  extension  u  is  sometimes  called  a  wavecrest. 
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Figure  9.5:  Amplitude-modulated  wave.  The  amplitude  of  the  carrier  wave  (blue,  traveling  at  phase  velocity  vw  =  col  k)  is  changed  over 
time,  resulting  in  an  envelope  (red)  which  travels  at  the  lower  group  velocity  v g. 


Figure  9.6:  Snapshots  of  the  wavefronts  (points  with  equal  phase)  of  a  longitudinal  wave  for  (a)  a  point  source  (with  spherical  wavefronts) 
and  (b)  a  planar  wave  (where  all  wavefronts  are  parallel  planes).  Successive  wavefronts  are  separated  by  one  wavelength. 


ambulance’s  driver  hears.  The  latter  is  easy:  the  driver  isn’t  moving  with  respect  to  the  siren,  so  (s)he  simply 
hears  it  at  whatever  frequency  it  is  emitting.  For  the  stationary  observer  however,  the  ambulance  moves  be¬ 
tween  emitting  the  first  and  second  wave  crest,  and  so  their  distance  (and  hence  the  observed  wavelength  / 
frequency)  changes,  see  figure  1.8. 

Calculating  the  shift  in  wavelength  is  straightforward.  Let  us  call  the  speed  of  sound  v  and  the  speed 
of  the  source  u.  The  time  interval  between  two  wavefronts,  as  emitted  by  the  source,  is  At.  In  this  time 
interval,  the  first  wavefronts  travels  a  distance  As  =  vAt,  while  the  source  travels  a  distance  Ax  =  uAt.  For 
the  observer  to  which  the  source  is  approaching,  the  actual  distance  between  two  emitted  wavefronts  is  thus 
Ax'  =  As- Ax  =  (v- u)At.  The  actual  distance  between  the  wavefronts  is  the  observed  wavelength,  A0bs,  while 


Christian  Doppler  (1803-1853)  was  an  Austrian  physicist.  Doppler  was  a 
professor  of  physics  at  Prague  where  he  developed  the  notion  that  the  ob¬ 
served  frequency  of  a  wave  depends  on  the  relative  speed  of  the  source  and 
the  observer,  now  known  as  the  Doppler  effect.  Doppler  used  this  princi¬ 
ple  to  explain  the  observed  colors  of  binary  stars.  The  principle  was  devel¬ 
oped  independently  by  French  physicist  Armand  Fizeau  (1819-1896),  and 
is  therefore  sometimes  referred  to  as  the  Doppler-Fizeau  effect.  In  1847, 

Doppler  moved  to  Selmecbanya  in  Hungary,  but  was  forced  to  leave  again 
soon  afterwards  due  to  political  unrest  in  1848,  moving  to  the  university 
of  Vienna.  During  a  visit  to  Venice  in  1853,  Doppler  died  of  pulmonary 
disease,  aged  only  49. 

Figure  9.7:  Christian  Doppler  [24]. 
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Figure  9.8:  Doppler  effect.  The  source  emits  waves  with  a  fixed  time  interval  Af  between  successive  wavefronts,  (a)  For  observers  that 
are  stationary  with  respect  to  the  source,  the  distance  between  wavefronts  (in  red)  is  fixed,  so  they  measure  the  same  wavelength  as  the 
one  emitted  by  the  source,  (b)  If  the  source  is  moving  with  respect  to  the  observers  (here  to  the  right),  the  observers  measure  a  different 
distance  between  arriving  wavefronts  -  compressed  (so  shorter  wavelength  /  higher  frequency)  if  the  source  is  approaching  (green  dot), 
expanded  if  the  source  is  receding  (purple  dot). 


Ernst  Mach  (1838-1916)  was  an  Austrian  physicist. 
Mach  was  a  professor  of  mathematics  and  later 
physics  at  Graz,  Prague  and  Vienna.  His  experimen¬ 
tal  work  focused  on  the  properties  of  waves,  especially 
in  light,  as  well  as  on  the  Doppler  effect  in  both  light 
and  sound.  In  1888,  Mach  used  photography  to  cap¬ 
ture  the  shockwaves  created  by  a  supersonic  bullet.  In 
addition  to  physics,  Mach  was  highly  interested  in  phi¬ 
losophy,  holding  the  position  that  only  sensations  are 
real.  Consequently,  Mach  refused  to  accept  that  atoms 
are  real,  as  they  could  not  be  observed  directly  at  the 
time;  it  was  Einstein’s  1905  work  on  Brownian  motion 
that  eventually  proved  him  wrong.  The  ratio  of  an  ob¬ 
ject’s  speed  to  the  speed  of  sound  is  now  known  as  the 
Mach  number  in  his  honor. 


Figure  9.9:  Left:  Mach’s  picture  of  the  shockwave  of  a  super¬ 
sonic  bullet  [25].  Right:  Ernst  Mach  (1902)  [26]. 


the  emitted  wavelength  is  A  =  As  -  vA  t,  so  the  two  are  related  through 

„  v-  u  „ 

A0bs  = - A.  (9.19) 

v 

For  a  source  that  is  moving  away,  we  simply  flip  the  sign  of  u;  naturally  for  a  stationary  source  we  have  A0bs  = 
A.  Note  that  we  could  also  consider  a  stationary  source  and  a  moving  observer:  the  effect  would  be  exactly 
the  same,  where  in  equation  (9.19)  we  define  motion  towards  the  source  to  be  the  positive  direction. 

The  Doppler  effect  is  usually  expressed  in  terms  of  frequency  instead  of  wavelength,  but  that  is  a  trivial 
step  from  equation  (  ),  as  f0 bS  =  WA0bS  and  f  —  vl  A,  which  gives: 


fobs  — 


V-  U 


-/■ 


(9.20) 


Although  we  discussed  the  Doppler  effect  here  in  the  context  of  sound  waves,  it  occurs  for  any  kind  of 
waves  -  most  notably  also  light.  We  will  encounter  it  again  when  we  discuss  waves  in  special  relativity  (where 
speeds  become  comparable  to  that  of  light)  in  section 

Note  that  equation  (9.19)  predicts  that  the  wavelength  is  zero  if  the  speed  of  the  source  equals  the  speed 
of  the  emitted  waves.  This  case  is  illustrated  in  figure  9.10a,  which  shows  that  the  wavefronts  pile  up.  For  a 
source  moving  faster  than  the  wave  speed  (figure  9.10b),  the  waves  follow  the  source,  creating  a  conical  shock 
wave,  with  opening  angle  given  by 

vT  v 
sind  =  — —  =  — . 
uT  u 

Both  the  bow  wave  of  a  boat  and  the  sonic  boom  of  a  supersonic  jet  are  examples  of  shockwaves. 


(9.21) 
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Figure  9.10:  Shock  waves.  Like  in  figure  9.8,  the  source  emits  wavefronts  with  a  fixed  time  interval  At.  (a)  If  the  source  is  moving  at  the 
same  speed  as  the  emitted  wave,  the  wavefronts  all  collide,  creating  a  shock  wave,  (b)  For  a  source  moving  faster  than  the  speed  of  the 
waves,  the  waves  all  travel  behind  the  source,  creating  a  conical  (shock)  wave  front  (as  you  may  have  heard  after  seeing  a  jet  fighter  pass 
overhead).  The  blue  dot  indicates  the  current  position  of  the  source,  the  green  dot  that  one  period  ago,  the  red  dot  two  periods  ago. 


9.8.  Problems 

9.1  Sound  waves  in  a  spring.  In  section  l,  we  found  that  the  speed  of  a  wave  in  a  string  is  given  by  v  - 

yjTI\i,  with  T  the  tension  in  the  string  and  p  its  mass  density  (equation  ). 

(a)  A  spring  of  mass  m  and  spring  constant  Ic  has  an  unstretched  length  Lq.  Find  an  expression  for  the 
speed  of  transverse  waves  on  this  spring  when  it’s  been  stretched  to  a  length  L. 

(b)  You  measure  the  speed  of  transverse  waves  in  an  ideal  spring  under  stretch.  You  find  that  at  a  certain 
length  L\  it  has  a  value  v,  and  at  length  2L\  the  wavespeed  has  value  3v.  Find  an  expression  for  the 
unstretched  length  of  the  spring  in  terms  of  Li . 

(c)  A  uniform  cable  hangs  vertically  under  its  own  weight.  Show  that  the  speed  of  waves  on  the  cable 
is  given  by  v  —  Jzg ,  where  z  is  the  distance  from  the  bottom  of  the  cable.  You  may  assume  that  the 
stretching  of  the  cable  is  small  enough  that  its  mass  density  can  be  taken  to  be  uniform. 

(d)  Show  that  the  time  it  takes  a  wave  to  propagate  up  the  cable  in  (  )  is  t  —  2yjL! g,  with  L  the  cable 
length. 


9.2  In  deep  water,  the  speed  of  surface  waves  depends  on  their  wavelength: 


v  = 


(9.22) 


(a)  Apart  from  satellite  images,  offshore  storms  can  also  be  detected  by  watching  the  waves  at  the  beach. 
Equation  (9.2  )  tells  us  that  the  longest-wavelength  waves  will  travel  the  fastest,  so  the  arrival  of  such 
waves,  if  their  amplitude  is  high,  is  a  foreboding  of  the  possible  arrival  of  a  storm  (the  friction  between 
the  wind  and  the  water  being  the  source  of  the  waves).  A  typical  storm  may  be  thus  detected  from 
a  distance  of  500  km,  and  travel  at  50  km/h.  Suppose  the  detected  waves  have  crests  200  m  apart. 
Estimate  the  time  interval  between  the  detection  of  these  waves  and  the  arrival  of  the  storm  (in  the 
case  the  storm  moves  straight  towards  the  beach). 


In  shallow  water,  the  speed  of  surface  waves  becomes  (to  first  order)  independent  of  the  wavelength,  but 
scales  with  the  depth  of  the  water  instead 

v  =  ^fg, d .  (9.23) 

(b)  Next  to  storms,  a  possible  source  of  surface  waves  in  the  ocean  are  underwater  earthquakes.  While 
storms  are  typically  more  dangerous  at  sea,  the  waves  generated  by  earthquakes  are  more  dangerous 
on  land,  as  they  may  result  in  tsunami’s:  huge  wavecrests  that  carry  a  lot  of  energy.  At  open  sea,  the 
amplitude  of  the  waves  that  will  create  the  tsunami  may  be  modest,  on  the  order  of  1  m.  What  will 
happen  with  this  wave’s  speed,  amplitude,  and  wavelength  when  it  approaches  the  land? 


9.3  Because  the  wave  equation  is  linear,  any  linear  combination  of  solutions  is  again  a  solution;  this  is  known 
as  the  principle  of  superposition,  see  section  9.4.  We  will  consider  several  examples  of  superposition  in 
this  problem.  First,  consider  the  two  one-dimensional  sinusoidal  traveling  waves  u±(x,t )  =  A  $\n(kx  + 
Dt). 


9.8.  Problems 
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(a)  Which  wave  is  traveling  in  which  direction? 

(b)  Find  an  expression  for  the  combined  wave,  u{x,  t)  =  u+{x,  t)  +  U-{x,  t).  You  may  use  that  sin(a)  + 
sin(/3)  =  2sin((a:  +  (5)12)  cos((a  -  /S) /2). 

(c)  The  combined  wave  is  a  standing  wave  -  how  can  you  tell? 

(d)  Find  the  positions  at  which  u[x,  t)  -  0  for  all  f.  These  are  known  as  the  nodes  of  the  standing  wave. 

(e)  Find  the  positions  at  which  u(x,  t)  reaches  its  maximum  value.  These  are  known  as  the  antinodes  of 
the  standing  wave. 

Next,  consider  two  sinusoidal  waves  which  have  the  same  angular  frequency  co,  wave  number  k,  and 
amplitude  A,  but  they  differ  in  phase: 

iii(x,  t)  -  Acos(kx-cot)  and  U2(x,  t)  —  Acos(kx- a>t  +  (p). 

(f)  Show  that  the  superposition  of  these  two  waves  is  also  a  simple  harmonic  (i.e.,  sinusoidal)  wave,  and 
determine  its  amplitude  as  a  function  of  the  phase  difference  (p. 

Finally  consider  two  sources  of  sound  that  have  slightly  different  frequencies.  If  you  listen  to  these,  you’ll 
notice  that  the  sound  increases  and  decreases  in  intensity  periodically:  it  exhibits  a  beating  pattern,  due 
to  interference  of  the  two  waves  in  time.  In  case  the  two  sources  can  be  described  as  emitting  sound 
according  to  simple  harmonics  with  identical  amplitudes,  their  waves  at  your  position  can  be  described 
by  u\{t)  =  Acos(wif)  and  U2(t)  =  Acos(a>2t). 

(g)  Find  an  expression  for  the  resulting  wave  you’re  hearing. 

(h)  What  is  the  frequency  of  the  beats  you’re  hearing?  NB:  because  the  human  ear  is  not  sensitive  to 
the  phase,  only  to  the  amplitude  or  intensity  of  the  sound,  you  only  hear  the  absolute  value  of  the 
envelope.  What  effect  does  this  have  on  the  observed  frequency? 

(i)  You  put  some  water  in  a  glass  soda  bottle,  and  put  it  next  to  a  440  Hz  tuning  fork.  When  you  strike 
both,  you  hear  a  beat  frequency  of  4  Hz.  After  adding  a  little  water  to  the  soda  bottle,  the  beat  fre¬ 
quency  has  increased  to  5  Hz.  What  are  the  initial  and  final  frequencies  of  the  bottle? 

9.4  One  of  your  friends  stands  in  the  middle  of  a  rectangular  10.0  x  6.0  m  swimming  pool,  his  hands  1.0  meter 
apart  in  the  direction  parallel  to  the  long  edge  of  the  pool.  He  produces  surface  waves  in  the  water  of  the 
pool  by  oscillating  his  hands.  At  the  edge,  you  find  that  at  the  point  closest  to  your  friend,  the  water  is 
rough,  then  if  you  move  to  the  side,  it  gets  quiet,  rough  again,  and  quiet  again.  That  point,  where  the 
water  gets  quiet  for  the  second  time,  lies  1.0  m  from  your  starting  point  (facing  your  friend). 

(a)  What  is  the  wavelength  of  the  surface  waves  in  the  pool? 

(b)  At  which  distance  does  the  water  get  quiet  for  the  first  time? 

(c)  And  at  which  distance  do  you  find  rough  water  for  the  third  time  (counting  the  initial  point)? 

9.5  The  Doppler  effect  is  the  shift  in  observed  frequency  of  a  wave  due  to  either  a  moving  observer  or  moving 
source,  as  discussed  in  section  .  We  will  consider  a  sound  wave  emitted  by  some  noisy  source  and 
observed  by  you. 

(a)  If  you  are  standing  still  and  the  source  is  moving  towards  you,  will  the  frequency  you  hear  be  higher 
or  lower  than  the  frequency  emitted  by  the  source? 

(b)  If  you  move  towards  a  stationary  source,  will  the  frequency  you  hear  be  higher  or  lower  than  the 
frequency  emitted  by  the  source? 

(c)  The  observed  frequency  f0 bS  depends  on  the  actual  frequency  emitted  by  the  source  /source  (obvi¬ 
ously),  the  speed  of  the  source  ^source,  the  speed  of  the  observer  v0bS  and  the  speed  of  sound  ;;S01IIU|. 
Take  the  observer  to  be  stationary.  What  happens  if  the  source  is  stationary  also?  And  what  if  the 
source  moves  at  the  speed  of  sound? 

(d)  Based  on  your  answers  to  the  previous  items,  guess  a  functional  form  for  f0 bs  as  a  function  of  /source. 

^source  <Uld  ^sourul • 
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(e)  Now  lets  do  the  math  properly.  Assume  0  <  ^source  <  fsound,  and  t'obs  =  0.  Also  assume  the  source 
moves  in  a  straight  line  towards  the  observer.  Plot  the  position  of  the  source  for  a  few  points  in  time, 
and  the  fronts  of  the  emitted  waves  at  the  same  point  in  time  (so  not  the  time  of  emission  -  then  they 
are  all  at  the  source,  but  the  time  of  last  emission,  or  slightly  after  that). 

(f)  Calculate  the  period  ^obs  measured  by  the  observer  by  considering  the  arrival  times  of  two  maxima 
emitted  a  time  interval  rsource  apart.  Hint:  first  find  the  distance  A L  the  observer  would  measure 
between  these  maxima,  i.e.,  the  observed  wavelength. 

(g)  Convert  your  answer  to  the  previous  point  to  an  expression  for  f0 bs. 

(h)  Was  your  guess  at  (5d)  right?  Which  aspects  of  the  correct  answer  could  you  not  have  guessed  based 
on  your  observations  at  (5c)? 

(i)  How  does  the  expression  for  f0 bS  change  if  the  observer  is  moving?  And  what  if  both  source  and 
observer  move? 

9.6  A  supersonic  (faster  than  sound)  object  (e.g.  a  fighter  plane)  is  moving  towards  you  at  speed  /'source, 
creating  a  shock  wave  in  the  shape  of  a  cone.  Show  that  the  opening  angle  of  this  cone  is  given  by  sind  = 
t'sound/  ^source  (equation  9.21).  Hint:  draw  the  positions  of  the  sound  wave  as  emitted  one,  two  and  three 
periods  ago  (as  in  figure  9 . 1  Ob) ,  then  use  geometry  to  find  the  angle  their  tangent  line  makes  with  the  line 
of  flight. 
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II 

Special  relativity 
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Einstein’s  postulates 


1 0. 1 .  AN  OLD  AND  A  NEW  AXIOM 

The  theory  of  special  relativity  is  built  on  two  postulates  (our  axioms  for  this  chapter).  The  first  one  also 
applies  to  classical  mechanics,  and  simply  states  that: 

Axiom  1  (Principle  of  relativity).  The  laws  of  physics  are  identical  in  every  inertial  reference  frame. 

You  probably  haven’t  heard  of  ‘inertial  reference  frames’  before.  Quite  likely,  you’ve  not  given  this  principle 
much  thought  either,  but  nonetheless,  you  are  (almost  certainly)  intimately  familiar  with  it  on  an  intuitive 
level.  Consider  an  example  we’ll  use  a  lot  in  this  chapter.  Suppose  you’re  in  a  train  car  with  no  windows. 
Is  there  any  experiment  you  can  devise  within  the  confines  of  the  car  that  will  tell  you  whether  the  train  is 
standing  still  or  moving  at  constant  velocity  (both  direction  and  magnitude)?  The  answer  is  no,  of  course  - 
that’s  a  direct  consequence  of  Newton’s  first  law.  A  pendulum  will  hang  straight  down  in  a  stationary  train  and 
in  one  moving  at  constant  velocity,  and  a  ball  you  roll  will  trace  out  a  straight  line  in  both  cases.  Things  change 
of  course  when  the  train  accelerates  (that’s  where  Newton’s  second  law  comes  in),  but  as  long  as  you  keep  your 
speed  (zero  or  not)  and  direction  fixed,  you  might  as  well  be  stationary  as  far  as  physics  is  concerned. 

Now  what’s  an  inertial  reference  frame?  A  reference  frame  is  simply  the  set  of  measures  you  use  to  describe 
the  world:  your  coordinate  system.  For  the  person  on  the  platform,  the  system  will  be  fixed  to  the  platform, 
with  the  origin  (for  example)  at  the  point  they’re  standing.  For  someone  on  the  train,  it’d  be  convenient  to 
have  the  origin  at  the  corner  of  the  car,  and  of  course  the  frame  co-moving  with  the  car.  An  inertial  reference 
frame  could  be  defined  either  as  any  reference  frame  that  moves  at  constant  velocity  with  respect  to  another 
inertial  reference  frame,  or  (as  is  most  often  done),  by  inverting  the  principle  of  relativity,  stating  that  an  in¬ 
ertial  reference  frame  is  one  in  which  the  laws  of  physics  (i.e.,  Newton’s  laws  in  classical  mechanics,  or  more 
specifically  Newton’s  first  law)  hold  without  modifications.  To  illustrate  that  this  is  not  a  trivial  point,  con¬ 
sider  a  rotating  reference  frame:  there  the  laws  of  physics  actually  change  (you  experience  additional  forces 
like  the  centrifugal  and  Coriolis  force),  and  you  could  figure  out  you’re  rotating  from  a  simple  experiment  (a 
pendulum  at  rest  will  no  longer  point  down,  but  slightly  outward). 

Returning  to  inertial  reference  frames,  there  is  one  more  point  to  be  made,  which  you  also  already  know. 
Both  position  and  velocity  are  relative  concepts,  in  the  sense  that  they  depend  on  the  observer.  In  the  train 
example  this  is  obvious.  From  the  point  of  view  of  a  person  sitting  on  the  train,  other  objects  on  the  train 
are  stationary  in  their  comoving  reference  frame,  so  at  a  fixed  position  and  zero  velocity;  the  observer  on  the 
platform  however  will  tell  you  that  the  same  object  has  a  changing  position  and  a  velocity  equal  to  that  of 
the  moving  train.  In  classical  mechanics,  what  is  not  relative  is  acceleration.  As  Newton’s  second  law  holds 
in  both  inertial  reference  frames,  the  same  force  gives  the  same  acceleration  according  to  both  observers. 
What  Einstein  discovered  is  that  although  this  observation  still  holds  at  relatively  low  velocities,  it  is  not  true 
at  higher  speeds.  Instead,  both  observers  will  agree  on  the  value  of  a  different  observable  quantity:  the  speed 
of  light  c. 

Axiom  2  (Light  postulate).  The  speed  of  light  in  vacuum  is  the  same  in  all  inertial  reference  frames. 

The  light  postulate  has  an  important  consequence:  it  sets  the  speed  of  light  as  the  ultimate  speed  limit  in  the 
universe.  Worse,  you  (or  any  other  object  with  mass),  cannot  even  travel  at  the  speed  of  light.  We’ll  show  this 
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10.  Einstein’s  postulates 


Albert  Einstein  (1879-1955)  was  a  German  physicist,  and  quite  likely  the 
most  widely  known  scientist  in  the  world  today.  Einstein  studied  physics 
in  Zurich,  but  could  not  find  a  research  position  after  he  graduated,  so  he 
combined  his  research  work  with  a  position  at  the  Swiss  patent  office  in 
Bern.  In  1905,  his  ‘miracle  year’,  Einstein  published  four  papers,  all  with 
enormous  impact  on  physics:  one  explaining  Brownian  motion  of  small 
particles  in  water,  one  on  the  (quantum-mechanical)  photo-electric  ef¬ 
fect,  one  on  special  relativity,  and  one  on  the  energy-matter  relation  (the 
famous  E  =  me2).  This  work  led  to  Einstein  becoming  a  professor  in  1909, 
and  receiving  the  Nobel  prize  (for  the  photo-electric  effect)  in  1921.  Ein¬ 
stein  extended  the  theory  of  relativity  to  include  gravity,  resulting  in  the 
prediction  of  the  bending  of  light  by  gravity  (1911,  confirmed  1919)  and  the 
existence  of  gravitational  waves  (1915,  confirmed  2015).  Einstein  became 
a  public  figure  in  the  1920s,  visiting  many  places  around  the  world.  When 
the  Nazi’s  seized  power  in  Germany  in  1933,  Einstein,  who  was  Jewish  and 
living  in  Berlin,  became  one  of  the  first  targets.  He  left  Germany  and  gave 
up  his  citizenship,  eventually  moving  to  Princeton  in  the  US,  and  advo¬ 
cating  for  the  active  extraction  of  fellow  Jewish  German  scientists.  Having 
been  a  pacifist  all  his  life,  Einstein  vehemently  opposed  war,  but  also  re¬ 
alized  that  Nazi  Germany  would  not  hesitate  to  build  and  use  an  atomic 
bomb,  so  he  argued  that  the  US  should  develop  one  also  (though  he  was 
horrified  when  it  was  used  against  Japan).  In  his  years  in  Princeton,  Ein¬ 
stein  tried  to  find  a  theory  unifying  gravity  and  quantum  mechanics,  but 
failed  to  do  so  (we  still  haven’t  succeeded);  he  did  not  like  the  random  na¬ 
ture  inherent  in  quantum  mechanics  and  tried  to  prove  it  was  incomplete 
(formulating  the  Einstein-Podolsky-Rosen  paradox),  which,  though  later 
proven  incorrect,  led  to  the  study  of  quantum  entanglement  that  is  the 
foundation  of  a  future  quantum  internet. 


Figure  10.1:  Official  1921  Nobel 
prize  portrait  of  Albert  Einstein 
[27]. 


mathematically  further  on,  but  a  simple  thought  experiment  suffices  to  show  that  this  point  is  true.  Assume 
the  opposite:  suppose  you  can  (and  do)  travel  at  light  speed  with  respect  to  a  stationary  observer.  Next, 
suppose  you  emit  a  light  pulse,  for  instance  by  switching  a  flashlight  on  and  off.  From  your  point  of  view,  the 
pulse  travels  at  light  speed,  so  it  speeds  ahead  of  you  quickly.  However,  from  the  stationary  observer’s  point 
of  view,  the  pulse  also  travels  at  light  speed  -  which  is  the  same  speed  you  travel  at,  so  the  photon  would  never 
leave  you.  As  the  photon  needs  to  either  leave  you  or  stay  with  you  (but  cannot  do  both) ,  we  arrive  at  a  logical 
contradiction,  and  conclude  that  you  cannot  travel  at  the  speed  of  light. 

10.2.  Consequences  of  Einstein’s  postulates 

The  combination  of  the  two  postulates  in  section  10.1  leads  to  a  number  of  consequences  that  appear  to 
be  at  odds  with  everyday  experience.  They  imply  that  there  are  no  such  things  as  universal  measures  of 
time  and  length,  nor  even  agreement  on  whether  events  are  simultaneous  or  not.  The  reason  why  we  don’t 
observe  these  consequences  all  the  time  is  that  their  effects  are  very  small  for  objects  which  are  moving  slowly 
(as  compared  to  the  speed  of  light).  Nonetheless,  they  do  exist,  and  can  be  measured  -  and  matter  a  lot  in 
situations  where  speeds  are  high,  such  as  in  particle  accelerators  and  cosmic  radiation. 

10.2.1.  )SS  OF  SIMULTANEITY 

Consider  the  following  (thought)  experiment.  Somebody  stands  in  the  middle  of  a  train  car  with  mirrors  at 
either  end.  The  car  moves  with  constant  speed  v  with  respect  to  the  platform,  at  which  we  place  a  second 
observer.  The  person  on  the  train  holds  two  laser  pointers,  which  she  presses  at  the  same  time  (or  perhaps  a 
device  with  a  single  button  that  sends  out  two  beams,  to  avoid  experimenter  bias) .  According  to  the  person  on 
the  train,  the  beams  reach  both  mirrors  simultaneously,  as  they  travel  at  the  same  speed,  and  cover  the  same 
distance.  Now  according  to  the  person  on  the  platform,  the  beams  also  travel  at  the  same  speed(!)  -  Einstein’s 
light  postulate  tells  us  that  all  observers  measure  the  speed  of  light  to  be  the  same.  However,  according  to 
this  stationary  observer,  the  train  also  moves,  and  thus  the  light  beam  traveling  to  the  front  of  the  train  has  to 
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cover  a  greater  distance  than  the  one  going  to  the  back  of  the  train.  Consequently,  the  backwards-traveling 
beam  arrives  at  its  mirror  before  the  forwards-traveling  one  does.  We  are  forced  to  conclude  that  events  that 
are  simultaneous  in  one  inertial  reference  frame  are  not  necessarily  simultaneous  in  another. 

Fortunately,  there  is  an  event  that  both  observers  agree  on:  the  fact  that  the  two  light  beams,  once  re¬ 
flected,  return  to  the  person  in  the  middle  of  the  train  at  the  same  time.  From  the  point  of  view  of  the  person 
on  the  train  this  is  obvious.  For  the  person  on  the  platform,  a  simple  calculation  shows  that  the  distance  that 
the  backwards-traveling  beam  gains  on  the  outbound  trip  equals  the  distance  it  looses  on  the  return  trip,  and 
vice  versa.  Different  observers  don’t  in  general  agree  on  the  simultaneity  (or  even  order)  of  events  happening 
at  different  points  in  space,  but  they  do  agree  on  the  order  of  events  at  a  given  point  in  space  -  which  means 
that  relativity  preserves  causality  (the  concept  that  causes  precede  effects). 


a)  at  rest  /comoving  b)  moving 

v 


z 


□ 


Figure  10.2:  The  light  clock  according  to  (a)  the  comoving  observer  and  (b)  a  stationary  observer  (if  the  clock  moves  at  speed  v  in  the 
direction  parallel  to  the  plane  of  the  mirrors) . 


10.2.2.  :E  DILATION 

Since  everybody  agrees  on  the  speed  of  light  (and  very  little  else),  it  makes  sense  to  use  the  speed  of  light 
to  determine  other  physical  quantities,  such  as  the  time  interval  between  two  events.  To  that  end,  we  can 
construct  a  light  clock:  we  place  two  mirrors  in  parallel,  and  let  a  beam  of  light  bounce  back  and  forth  between 
them  (figure  10.2).  Since  we  agree  on  at  least  the  order  of  events  at  the  same  point  in  space,  let’s  take  as 
our  time  interval  the  time  it  takes  for  the  light  to  travel  back  and  forth  between  the  mirrors  once  (i.e.,  one 
round  trip).  According  to  the  comoving  observer  (whose  coordinates  we’ll  denote  with  primes,  and  whose 
system  of  coordinates  we’ll  call  S'),  this  time  interval  is  given  by  At'  =  2 Lie,  as  the  distance  traveled  by  the 
light  is  simply  twice  the  perpendicular  distance  between  the  mirrors.  However,  for  the  stationary  observer  it 
seems  that  the  light  has  moved  in  the  direction  parallel  to  the  mirrors  as  well  (see  figure  ,2b).  We’ll  denote 
his  coordinate  system  as  S,  with  time  interval  At.  The  total  distance  traveled  by  the  light  is  (invoking  the 
Pythagorean  theorem)  \J (2 L)2  +  {vAt)2,  which  should  equal  cAt.  Solving  for  At,  we  find  (At)2  =  (2 i)2/(c2  - 
v2),  which  is  longer  than  the  time  interval  measured  by  the  comoving  observer  (that  makes  sense  -  if  the  light 
travels  at  the  same  speed,  a  larger  distance  should  take  longer).  The  two  time  intervals  are  related  by: 

1 

At  —  - At  -  yliAAt  .  (10.1) 

Vl-v2/c2 

The  factor  y(v)  will  return  frequently  in  this  chapter.  Its  value  is  one  if  v  -  0,  and  becomes  progressively  larger 
as  v  increases,  to  blow  up  at  v  =  c.  For  ‘small’  (compared  to  c)  values  of  v  the  value  of  y(  u)  is  very  close  to 
one,  which  is  why  the  effects  of  special  relativity  hardly  ever  show  up  in  everyday  life. 

We  have  established  that  time  intervals  between  two  events  are  different  for  two  different  (comoving  and 
stationary)  observers.  There  is  a  subtle  but  important  point  to  make  about  how  time  is  perceived  to  progress 
according  to  a  single  observer.  Let’s  start  out  with  the  stationary  observer.  Suppose  this  observer  both  mea¬ 
sures  time  with  a  light  clock  in  his  own  frame  of  reference,  and  observes  an  identical  light  clock  on  a  moving 
train.  Since  the  time  interval  for  the  roundtrip  of  the  light  in  the  comoving  (or  in  this  case,  co- stationary)  light 
clock  is  measured  to  be  less  than  that  of  the  moving  clock,  the  stationary  observer  concludes  that  the  moving 
clock  is  running  slow.  This  observation  is  universal,  and  known  as  time  dilation.  To  stress  how  universal  time 
dilation  is,  consider  the  point  of  view  of  the  person  on  the  train:  according  to  her,  her  clock  is  running  ‘nor¬ 
mal’,  while  the  clock  on  the  platform  runs  slow  -  which  is  in  perfect  agreement  with  the  above  statement,  as 
from  the  point  of  view  of  the  train,  it’s  the  platform  that’s  moving. 
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The  Michelson-Morley  experiment  (1887)  was  an  attempt  to  mea¬ 
sure  the  relative  motion  of  the  Earth  with  respect  to  the  aether, 
a  substance  that  was  postulated  to  fill  all  of  space.  It  was  well 
known  in  the  19th  century  that  the  Earth’s  atmosphere  only  ex¬ 
tends  to  about  100  km  up,  and  since  sunlight  can  reach  the  Earth, 
it  was  postulated  that  there  must  be  another  substance  that  acted 
as  the  medium  for  the  propagation  of  light  (much  like  sound  prop¬ 
agates  through  air,  water,  or  even  solids,  but  not  through  vacuum) . 

Since  the  Earth  moves  around  the  sun,  it  should  move  relatively  to 
the  aether,  or  from  the  point  of  view  of  an  observer  on  Earth,  the 
aether  should  flow  through  space  (‘aether  wind’).  Consequently, 
the  aether  should  affect  the  speed  of  light  for  a  beam  traveling  in 
the  same  direction  as  the  wind,  but  not  one  traveling  perpendicu¬ 
lar  to  this  direction.  Michelson  and  Morley  attempted  to  use  this 
principle  to  measure  the  speed  of  the  aether  wind,  with  a  device 
now  known  as  a  Michelson  interferometer  (pictured).  However, 
they  found  no  difference  at  all,  for  any  angle  of  the  interferometer 
arms.  Lorentz  (see  next  box)  initially  attempted  to  explain  this  re¬ 
sult  by  introducing  the  concept  of  ‘local  time’,  which  would  lead  to 
a  Lorentz  contraction  of  one  of  the  arms  with  respect  to  the  other, 
canceling  the  effect  of  the  aether  wind.  Einstein  took  a  more  rad¬ 
ical  approach,  dropping  the  concept  of  the  aether  altogether,  and 
replacing  it  with  his  two  postulates,  which  have  Lorentz  contrac¬ 
tion  as  one  of  their  consequences.  Three  (very  sensitive)  Michelson 
interferometers  have  recently  been  used  to  detect  small  vibrations 
in  spacetime  itself,  the  gravitational  waves  predicted  by  Einstein’s 
general  theory  of  relativity 
If  the  apparatus  moves  to  the  right  with  speed  v,  the  speed  of  light  on  path  1  (up-down)  is  given  by 
Mi  =  |  Mi  |  =  V  c2  -  v2.  If  the  distance  between  the  beamsplitter  and  the  mirror  is  L,  the  time  it  takes  to 
traverse  path  1  (back  and  forth)  is  then  given  by 

_  2 L  _  2 L  _  2 Lie 

ul  \J C2  -  V2  y/l  -  (vie)2 

For  path  2,  the  speed  of  the  light  on  the  way  out  to  the  mirror  equals  m0Ui  =  c  -  v,  while  the  speed  on  the 
return  path  equals  U[n  =  c  +  v.  The  total  time  for  the  trajectory  (also  of  length  L)  thus  equals 

L  L  2 Lie  1 

t'l  — - 1 - —  - 77  —  -  t\  , 

c-v  C+V  1  -{vlcY  yj\-{vlc)2 

so  path  2  always  takes  less  time  than  path  1.  (Note  that  the  proportionality  factor  is  exactly  the  same  as 
the  one  we'll  find  relating  the  time  observed  by  a  comoving  and  a  stationary  observer  in  the  relativistic 
picture  -  Lorentz  wasn't  that  far  off!).  If  however  the  speed  of  light  is  identical  in  paths  1  and  2,  the  time  it 
takes  to  traverse  either  of  them  is  identical  too,  and  we  expect  no  interference. 


mirror 


- > 
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Figure  10.3:  A  Michelson  interferome¬ 
ter  splits  a  light  beam  in  two,  then  mea¬ 
sures  the  interference  of  the  two  beams 
after  they  have  returned.  If  the  whole 
system  moves  with  speed  v  in  the  di¬ 
rection  indicated  by  the  blue  arrow,  the 
time  it  takes  the  light  to  traverse  path  1 
is  longer  than  it  takes  to  traverse  path  2, 
resulting  in  interference  fringes. 
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A  rather  famous  example  of  the  effect  of  time  dilation  is  the  observation  of  the  number  of  high-velocity 
muons  (particles  similar  to  electrons,  but  much  heavier  and  unstable)  at  the  surface  of  the  earth.  The  muons 
are  created  in  the  upper  atmosphere  (about  20  km  up),  when  cosmic  rays  collide  with  atmospheric  atoms. 
The  muons  have  a  decay  half-time  of  2.2  ps  (meaning  that  after  this  time  half  of  the  original  muons  have 
decayed).  The  muons  are  created  at  very  high  speed,  close  to  that  of  light  ( v  =  0.999c).  Even  so,  classically 
they  can  only  travel  about  650  m  before  half  of  them  are  gone,  and  almost  none  will  reach  the  surface  of  the 
earth.  However,  since  according  to  us  stationary  observers  on  earth  the  muons’  clocks  run  slow,  we  expect 
the  half-time  to  be  effectively  extended  with  a  factor  y(0.999c)  =  22,  resulting  in  a  distance  of  about  15  km 
before  half  of  them  have  decayed,  and  a  significant  number  reaching  the  surface  -  as  is  observed. 

a)  at  rest /comoving  b)  moving 
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Figure  10.4:  The  horizontal  light  clock  according  to  (a)  the  comoving  observer  and  (b)  a  stationary  observer  (if  the  clock  moves  at  speed  v 
in  the  direction  perpendicular  to  the  plane  of  the  mirrors). 


10.2.3.  .ORENTZ  CONTRACTION 

Not  only  do  observers  in  different  inertial  frames  not  agree  on  the  duration  on  time  intervals,  they  don’t  agree 
on  the  length  of  objects  (i.e.,  the  size  of  space  intervals)  either.  For  distances  perpendicular  to  a  direction  of 
motion  there’s  no  issue  -  for  the  light  clock  in  figure  .2,  both  observers  measure  a  distance  L  between  the 
two  mirrors.  However,  if  they  were  to  be  asked  about  say  the  length  of  the  wagon  of  the  train  we’re  imagining 
the  moving  observer  to  be  in,  their  answers  wouldn’t  agree. 

To  see  how  this  difference  in  measured  length  comes  about  and  how  lengths  are  related,  we  return  to  the 
light  clock,  but  now  turn  it  on  its  side  (figure  )  .  For  both  observers,  measuring  the  distance  between  the 
mirrors  can  be  done  by  measuring  the  time  it  takes  a  lightbeam  to  make  a  roundtrip  between  them.  For  the 
comoving  observer,  we  find  that  1/  =  he  At1.  For  the  stationary  observer,  the  picture  is  more  complicated,  as 
the  mirrors  move  while  the  light  travels.  On  the  way  out,  the  distance  the  light  has  to  travel  is  the  distance  L 
between  the  mirrors,  plus  the  distance  i/Afright  the  far  mirror  moves.  This  total  distance  should  equal  cAfright> 
so  we  get  A  fright  =  Ll{c-  v).  On  the  way  back,  we  get  a  traveled  distance  of  L  minus  pAfieft,  which  should  equal 
cAfieft,  so  we  get  A  =  L/(c+  v).  The  total  time  traveled  is  the  sum  of  these  two,  which  is  given  by: 


L  L  2  Lc 

At  —  A  fright  +  A  fieft  —  I  —  ..  ~ 

C—V  C  +  V  cz  -  vz 


(10.2) 


or 


j{v)At '  = 


(10.3) 


1  Alternatively,  we  may  say  that  we  measure  the  length  of  the  train  wagon  by  sending  a  light  beam  back  and  forth  in  the  wagon,  bouncing 
off  a  mirror  at  the  end. 
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The  two  lengths  are  thus  related  by  the  inverse  of  the  y  factor  that  relates  two  time  intervals.  Using  an  argu¬ 
ment  similar  to  that  of  the  clocks  (relating  the  length  of  two  identical  sticks,  one  stationary  and  one  moving), 
we  can  conclude  that  moving  lengths  contract,  an  effect  known  as  Lorentz  contraction. 

There  is  an  interesting  symmetry  between  time  dilation  and  Lorentz  contraction,  which  gives  an  alter¬ 
native  way  of  getting  equation  (  )  once  the  effect  of  time  dilation  is  known.  Consider  again  the  example 

of  the  muons,  but  now  go  to  the  frame  co-moving  with  the  muons.  In  this  frame,  the  decay  half-time  is  still 
2.2  ps,  but  the  same  number  of  muons  reach  the  surface  of  the  earth  as  in  the  stationary  frame.  The  reason  is 
that  according  to  the  muons,  the  distance  from  the  upper  atmosphere  to  the  surface  is  contracted,  by  exactly 
a  factor  l/y(i/),  which  gives  the  same  distance  of  15  km  at  which  half  of  them  have  decayed. 


10.3.  Problems 
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Figure  10.5:  A  person  standing  at  the  back  of  a  moving  train  car  fires  a  photon  and  a  bullet  at  the  same  time.  The  photon  reflects  off  a 
mirror  at  the  front  of  the  train,  and  meets  up  with  the  bullet  sometime  later. 


10.3.  Problems 

10.1  Suppose  you  wake  up  in  Rotterdam  (10  km  from  Delft)  1  minute  before  class  in  Delft  starts  (i.e.,  at  8:44). 
You  dress  quickly,  but  have  only  10  seconds  left  to  get  to  Delft.  You  happen  to  be  a  very  fast  biker,  making 
it  exactly  when  your  watch  shows  8:45.  Will  the  rest  of  the  class  agree  that  you  arrived  just  in  the  nick  of 
time? 

10.2  A  well-known  (but  probably  apocryphal)  Einstein  quote  is  ‘Sit  on  a  hot  stove  for  five  minutes,  and  it  feels 
like  an  hour.  Talk  to  a  pretty  girl  for  an  hour,  and  it  feels  like  five  minutes.  That’s  relativity.’ 

(a)  Einstein  (at  rest,  frame  S)  sits  on  pins  and  needles  for  five  minutes.  Could  there  be  a  moving  frame 
S'  in  which  this  same  period  lasts  an  hour?  If  so,  determine  the  velocity  of  that  frame  with  respect  to 
S,  if  not,  explain  why  not. 

(b)  Einstein  talks  with  Marilyn  Monroe  for  an  hour.  (According  to  another  well-known  anecdote,  during 
this  conversation  Marilyn  Monroe  would  have  said  to  Einstein  ‘If  we  were  to  have  children,  and 
they’d  have  your  brains  and  my  looks,  wouldn’t  that  be  fantastic?’,  to  which  Einstein  replied  'Yes,  but 
what  if  they’d  have  your  brains  and  my  looks?’).  Both  Einstein  and  Monroe  are  at  rest  in  frame  S. 
Could  there  be  a  moving  frame  S'  in  which  this  same  period  lasts  five  minutes?  If  so,  determine  the 
velocity  of  that  frame  with  respect  to  S,  if  not,  explain  why  not. 

10.3  How  fast  would  you  have  to  fly  such  that  you  cover  exactly  one  lightyear  (as  measured  by  a  stationary 
observer)  in  one  year  (as  measured  on  your  clock)? 

10.4  Adding  velocities  Einstein  postulated  that  the  speed  of  light  (in  vacuum,  but  we’ll  ignore  that  point)  is 
the  same  in  any  inertial  reference  frame.  Consequently,  for  any  object  with  mass,  the  speed  of  light  is 
also  the  absolute  limit:  you  can  never  reach  it,  let  alone  exceed  it.  That  doesn’t  fit  well  with  everyday 
experience:  if  you’re  on  a  train  moving  at  speed  u,  and  throw  a  ball  at  speed  v,  an  outside  (stationary) 
observer  will  measure  the  ball’s  speed  to  be  u  +  v.  There  seems  to  be  no  fundamental  reason  why  we 
couldn’t  take,  say,  u-v-  0.8c,  which  would  imply  that  the  outside  observer  measures  the  ball  to  go 
at  1.6c.  The  strange  thing  is  that  (s)he  doesn’t  -  the  outside  observer  will  tell  you  that  they  measured  a 
speed  of  only  0.976c.  The  reason  is  that  although  both  you  and  the  outside  observer  consider  speed  to 
be  the  distance  traveled  divided  by  the  time  it  took  to  travel  that  distance,  you  no  longer  measure  either 
the  same  distance  or  the  same  time  at  such  high  speeds.  In  this  exercise,  we’ll  derive  a  new  equation  for 
adding  speeds,  which  shows  that  you  can  never  break  the  speed-of-light  limit. 

We  consider  the  situation  sketched  in  figure  .  A  train  is  moving  with  speed  u.  Someone  standing 
at  the  back  of  the  train  car  fires  a  photon  (yellow,  speed  c)  and  a  bullet  (blue)  at  the  same  time.  As 
measured  on  the  train,  the  bullet  has  speed  u;  for  an  outside  observer,  the  bullet  has  speed  w.  The 
photon  is  reflected  by  a  mirror  at  the  front  of  the  car,  and  meets  up  with  the  bullet  at  a  point  a  distance  (f>L 
from  the  front.  Note  that  L,  the  length  of  the  car,  is  measured  differently  by  the  comoving  and  stationary 
(outside)  observer,  as  is  the  time  interval  between  firing  and  meeting  of  the  photon  and  bullet.  However, 
both  observers  agree  that  at  the  events  of  firing  and  meeting  the  photon  and  bullet  are  at  the  same  place 
in  space  and  time.  We’ll  use  these  two  points  to  find  a  relation  between  the  speeds  of  the  bullet  as 
observed  in  the  stationary  and  comoving  frame,  and  the  speed  of  the  train. 

(a)  First,  you’ll  take  the  point  of  view  (and  spacetime  coordinates)  of  the  outside  observer.  Denote  the 
length  of  the  train  (as  observed  by  you)  by  L,  the  time  it  takes  the  photon  to  reach  the  front  of  the 
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train  by  t\,  and  the  time  it  takes  to  return  to  the  meeting  point  by  t-^.  The  train  moves  at  speed  u,  the 
observed  velocity  of  the  bullet  is  w  and  that  of  the  photon  is  (obviously)  c.  Express  ct\,  the  distance 
traveled  by  the  photon  in  fi ,  in  terms  of  L,  u,  and  t\ . 

(b)  Now  express  ct2,  the  distance  traveled  by  the  photon  in  t2,  in  terms  of  L,  u,  t2,  and  </>. 

(c)  From  the  two  equations  you  have,  eliminate  L  and  then  rewrite  the  resulting  equation  in  an  expres¬ 
sion  for  f 2  /  T 1  ■ 

(d)  Argue  that  w{t\  +  £2)  =  c(t\  —  t2)  by  considering  where  the  bullet  and  photon  meet  up. 

(e)  Rewrite  the  expression  in  (  )  to  also  give  an  expression  for  ^/fl. 

(f)  Equate  the  expressions  you  found  in  (  )  and  (  ),  then  rewrite  them  as  an  expression  for  <p. 

(g)  Now  we’ll  take  the  comoving  point  of  view.  Find  another  expression  for  <p.  Hint:  you  can  either 
repeat  the  procedure  above  for  the  comoving  observer,  or  ‘translate’  the  expression  in  ( If)  directly 
to  the  comoving  frame. 

(h)  Equate  the  two  expressions  for  cf>  to  get  an  expression  for  w  that  only  contains  u,  v  and  c. 

(i)  The  rest  is  algebra:  show  that  the  expression  you  found  in  (  )  can  be  rewritten  as: 


w 


r  ■  II 
1  +  uvlc2 


(10.4) 


10.5  [Challenging]  Three  bars  are  welded  together  into  the  shape  of  an  isosceles  triangle  with  side  ratios  5:5:6. 
An  observer  looks  at  our  triangle  from  a  direction  perpendicular  to  the  plane  it  spans,  as  in  figure  a.  The 
observer  moves  with  relativistic  velocity  v  in  a  direction  parallel  to  the  plane  of  the  triangle. 


(a)  In  which  direction  and  at  which  speed  should  the  observer  be  moving  so  that  according  to  him,  the 
triangle  is  equilateral? 

(b)  We  now  rotate  the  triangle  in  such  a  way  that  all  its  corners  and  the  observer  are  in  one  plane.  The 
longest  side  of  the  triangle  is  arranged  perpendicular  to  the  observer’s  line  of  sight,  and  at  the  far 
end,  as  in  figure  b.  The  observer  moves  with  speed  v  (not  necessarily  the  same  speed  as  in  part  (a) 
in  the  direction  parallel  to  this  long  edge.  Show  that  the  observer  sees  a  distorted  triangle. 

(c)  As  the  observer  keeps  moving  along  the  same  line  as  in  (b) ,  show  that  it  seems  to  him  as  if  the  triangle 
is  rotating,  and  determine  in  which  direction  it  rotates. 


11 

Lorentz  transformations 


As  we’ve  seen  at  the  end  of  section  10.1 ,  the  light  postulate  implies  that  there  is  a  truly  universal  speed  limit: 
nothing  can  move  faster  than  the  speed  of  light.  This  statement  doesn’t  fit  very  well  with  the  classical  me¬ 
chanics  world  you’re  used  to.  After  all,  if  you’re  on  a  train  moving  at  speed  u,  and  throw  a  ball  at  speed  v,  a 
(stationary)  observer  on  the  platform  will  measure  the  ball’s  speed  to  be  u  +  v.  You  may  not  be  able  to  move 
at  the  speed  of  light,  but  there  is  no  fundamental  reason  why  the  train  could  not  move  at,  say,  u  —  0.8c,  and 
similarly  no  reason  why  you  couldn’t  throw  the  ball  at  v  -  0.8c  either.  However,  as  we’ll  see  below,  in  this  case 
the  outside  observer  does  not  measure  the  ball’s  speed  to  be  1.6c,  but  ‘only’  0.976c,  so  less  than  the  speed  of 
light.  The  reason  is  that  the  simple  speed  addition  equation  t'cmtside  -  u  +  v  does  not  hold  when  u  or  v  gets 
close  to  c,  another  consequence  of  the  light  postulate.  We’ll  derive  the  correct  equation  for  velocity  addition 
in  this  section  using  one  method,  and  in  problem  0.2 A  you  can  do  it  yourself  using  another  method,  based 
on  an  extension  of  the  train  example. 


11.1.  Classical  case:  Galilean  transformations 

To  figure  out  how  velocities  add  in  our  new  reality  set  by  the  light  postulate,  we  need  to  reconsider  the  world¬ 
view  of  a  stationary  and  moving  observer,  each  in  their  own  inertial  reference  frame.  In  classical  mechanics, 
for  an  observer  moving  at  speed  u  in  the  x-direction,  we  can  find  the  coordinates  of  this  observer’s  reference 
frame  with  respect  to  those  of  a  stationary  observer  using  a  simple  set  of  transformation  rules: 


X  —  x— ut, 

(11-la) 

/  =  y. 

(11.1b) 

Z  =  Z, 

(11.1c) 

t'  =  t. 

(11- Id) 

Here  the  primed  variables  denote  the  coordinates  of  the  moving  observer,  and  the  unprimed  variables  the 
stationary  ones.  We’ll  call  the  stationary  frame  S,  and  the  moving  frame  S'.  Of  course  we  could  also  express 
the  coordinates  of  S  in  those  of  S'  -  that’s  just  equation  (11  )  with  the  sign  of  u  flipped.  Note  that  we  included 
the  observation  that  time,  as  measured  by  both  observers,  is  the  same,  as  well  as  the  y  and  z  coordinates 
(since  the  train  moves  in  the  x  direction  -  and  we  can  just  pick  the  x  direction  to  be  the  one  the  train  moves 
in).  Equation  (11.  )  is  known  as  the  Galilean  coordinate  transformation.  Note  that  it  fits  with  the  classical 
statement  that  accelerations  are  the  same  as  measured  in  any  reference  frame: 


2x'  d2[x-ut)  d  /  dx  )  d“x 

d(f')2  df2  df  1  dt  J  df 2 


11.2.  Derivation  of  the  Lorentz  transformations 

Instead  of  constant  acceleration,  in  the  theory  of  relativity  we  have  a  constant  speed  of  light  in  each  inertial 
reference  frame.  That  means  that  the  transformation  rules  (  )  change.  One  thing  that  will  not  change  is 

that  spatial  directions  in  which  there  is  no  motion  are  measured  the  same  by  all  observers  (our  observers 
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Hendrik  Antoon  Lorentz  (1853-1928)  was  a  Dutch  physicist,  considered 
widely  as  the  leading  theoretical  physicist  of  his  time.  At  age  24,  Lorentz 
became  a  professor  at  Leiden  university  where  he  initally  worked  on  elec¬ 
tromagnetism.  He  provided  the  theoretical  explanation  for  the  recently 
discovered  (quantum-mechanical)  Zeeman  effect,  for  which  he  and  Zee- 
man  shared  the  Nobel  prize  in  1902.  Around  1900,  Lorentz  developed  the 
set  of  transformations  now  named  after  him  in  an  attempt  to  interpret  the 
results  of  the  Michelson-Morley  experiment.  As  Einstein  built  the  theory 
of  relativity  on  the  mathematical  tools  provided  by  Lorentz,  it  was  origi¬ 
nally  referred  to  as  the  Lorentz-Einstein  theory;  Lorentz  himself  quickly 
appreciated  Einstein’s  insights  and  consistently  referred  to  ‘Einstein’s  prin¬ 
ciple  of  relativity’.  Lorentz  resigned  from  his  position  in  Leiden  in  1912  to 
have  more  time  to  do  research,  moving  to  the  Teylers  museum  in  Haarlem 
(still  open  today);  Lorentz’  successor  in  Leiden,  Paul  Ehrenfest,  founded 
an  institute  for  theoretical  physics  there  that  is  now  known  as  the  Lorentz 
institute.  From  1918  till  1926  Lorentz  focussed  his  efforts  on  maritime 
engineering,  as  chair  of  the  committee  charged  with  designing  the  Afs- 
luitdijk,  a  32  km  dike  that  closes  off  the  former  Zuiderzee  in  the  north 
of  the  Netherlands.  Lorentz  solved  the  various  necessary  hydrodynamic 
problems  numerically  by  hand,  one  of  the  first  engineering  problems  ap¬ 
proached  in  this  way;  when  construction  was  finished,  it  turned  out  that 
his  calculations  had  been  highly  accurate.  One  of  the  two  sets  of  locks  in 
the  dyke  is  named  after  him. 


Figure  11.1:  Hendrik  Antoon 
Lorentz,  around  1916  [28]. 


are  after  all  both  stationary  in  the  y  and  z  directions),  so  we’ll  only  consider  the  x  direction,  and  time.  To 
treat  space  and  time  on  an  equal  footing,  they  should  of  course  have  the  same  dimensions.  Fortunately,  the 
one  universal  constant  in  special  relativity,  the  speed  of  light  c,  converts  a  time  to  a  space  (or  vice  versa), 
so  we'll  consider  a  transformation  on  position  x  and  ‘time’  ct.  A  second  thing  that  won’t  change  is  that  the 
transformations  have  to  be  linear.  If  they  were  not,  they  would  violate  the  principle  of  relativity,  because  then 
the  length  of  an  object  (or  of  a  time  interval)  would  depend  on  the  choice  of  origin  of  our  coordinate  frames 
S  and  S'. 

For  a  general  linear  transformation,  we  write: 


ct 


=  A 


x 

ct 


with 


A  = 


a\ i  a\2 
a21  a22 


(11.3) 


We  want  our  transformation  to  be  invertible,  so  det(A)  ^  0  and 


(  x  \  1  I  a22  -au  W  x'  ) 

1  ct  )  det(A)  v  ~a2i  an  A  ct'  )' 


(11.4) 


We  can  find  the  coefficients  of  A  by  simply  demanding  that  S'  moves  relative  to  S  at  constant  speed  u,  and 
the  value  of  c  is  the  same  in  S  and  S’.  To  start  with  the  first  condition,  consider  a  stationary  point  in  S',  so 
x!  —  b' .  From  (  )  we  have  x'  -  a\\x+  ai2ct,  or  x  =  [x'  -  a\2Ct)l a\\.  On  the  other  hand,  in  S  this  point 

moves  at  speed  u,  so  the  same  point  is  described  by  an  equation  of  the  form  x-  u  t  +  xq  .  Therefore,  we  have 
-a\2c! a\\  -  u.  Naturally,  we  can  also  consider  a  stationary  point  in  S,  x  =  b  =  (a22x!  -  a\2ct')l  det(A),  which 
is  moving  at  speed  —u  in  S',  so  x'  —  -ut'  +  x'Q,  which  gives  -u  =  a^cl  a22-  It  follows  that  a22  =  -a\2clu  -an, 


The?  derivation  that  follows  is  not  mathematically  difficult  (these  are  all  linear  equations,  after  all),  but  it  contains  a  fairly  large  number 
of  steps.  The  easiest  way  to  get  your  head  around  them  is  to  take  a  piece  of  paper  and  do  them  yourself. 


11.2.  Derivation  of  the  Lorentz  transformations 


117 


and  we  can  rewrite  our  transformations  (  1 .3)  and  (  )  as: 


/  #12 

x  =  a  11  (xh - ct)  =  fln(x-  ut), 

an 

(11.5a) 

!  #21 

Ct  —  fl21*+  a22Ct  —  fln( - x+ct), 

an 

(11.5b) 

^11  /  f 

X  =  - (x  +  ut ), 

det(A) 

(11.5c) 

tin  1  tl2\  ,  ,\ 

ct  - - - x  +  cr  . 

det(A)  l  an  1 

(11. 5d) 

Note  that  with  an  —  1  and  roi  =  0,  these  are  simply  the  Galilean  transformations  again.  We’ll  allow  a2i  ^  0  to 
accommodate  for  the  light  postulate.  To  see  how  that  works,  we  first  calculate  the  velocity  of  a  moving  object 
in  either  reference  frame,  and  relate  them  to  each  other: 


,  _  Ax'  a\\A{x-ut )  dx-udt 

dr'  an  d{{a2\l  an){xl  c)  +  t)  {U2\l  ca\\)dx+ dt 

dxldt-u  v-u 

1  +  (fl2i/cfln)dx/dr  1  +  {a2\l  an){vl  c)’ 

where  we  used  v  =  dxl  dt.  Inversely,  we  have: 

dx  v1  +  u 

dr  1  -  [a2\! a\\)(v' I c) " 


(11.6) 


(11.7) 


As  the  light  postulate  states,  it  doesn’t  matter  if  we  measure  c  in  S  or  S',  we  always  get  the  same  number. 
So  for  light,  we  should  have  v'  —  c  —  v,  which  we  can  use  in  equation  (  1.6)  to  determine  a2\la\\. 


The  transforms  now  become: 


c—u 

1  +  (fl2l/flll) 


SO 


#21  ll 

#11  C 


(11.8) 


x!  —  an(x  —  ut), 
ux\ 
c 


,  (  UX\ 

ct  =  an  |cr - J , 


1 


1 


x  - 


ct- 


a\\  1  -  u2lc2 
1  1 
an  1  -  u2lc2 


{x'  +  ut'), 


ct  +  ■ 


(11.9a) 

(11.9b) 

(11.9c) 
(11. 9d) 


In  equations  (  )  we  used  det(A)  =  al{]  (1  -  u2/c2).  We  are  left  with  one  undetermined  parameter,  the  value 

of  a\  1 .  We’ll  use  it  to  make  the  transformation  symmetric  -  after  all,  we  could  have  started  with  S'  as  stationary 
and  S  as  moving  (with  speed  -u),  and  we  should  get  the  same  transforms,  except  for  the  sign  of  u.  Equating 
the  prefactor  in  equations  (  )  and  (  ),  we  find  that  an  =  y(w),  with  j{u)  again  defined  as 


y(w)  = 


1 

yjl  -  ( ulc )2 


(11.10) 


Note  that  j{u)  —  j(-u),  in  accordance  with  the  earlier  notion  that  it  doesn’t  matter  whether  you  are  in  S 
watching  S'  move  at  u,  or  in  S'  watching  S  move  at  -  u.  We  have  now  arrived  at  the  Lorentz  transformations : 


x  =  y(z<)  (x-  ut), 

(11.11a) 

ct  —  y(zr)  - J , 

(11.11b) 

x  =  y(w)  (x'  +  ut') , 

(11.11c) 

(  ,  ux! ) 

(11. lid) 

ct  =  j[u)  ct  + -  . 

V  c  I 

The  Lorentz  transformations  transform  both  space  and  time.  Consequently,  our  two  observers  do  not 
only  measure  space  differently,  as  in  the  classical  system  (recall  the  stationary  and  comoving  coordinates), 
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but  they  also  measure  time  differently!  For  small  speeds,  y(zr)  is  (very)  close  to  one  and  the  effect  negligible, 
but  for  high  speeds  it  certainly  is  not.  As  we  have  already  seen  in  the  previous  section  based  on  the  train 
argument,  and  see  again  below,  these  different  time  measurements  lead  to  potentially  confusing  results:  the 
two  observers  no  longer  agree  on  which  events  are  simultaneous,  how  long  a  meter  stick  is,  or  how  long  it 
took  to  travel  from  one  place  to  another. 

Before  going  to  the  applications,  we  have  a  few  closing  remarks  about  the  Lorentz  transformations.  First, 
we  put  in  some  effort  to  make  the  transformation  symmetric  between  going  from  S  to  S'  and  vice  versa.  We 
can  do  more  though.  Since  time  and  space  now  both  transform,  and  get  mingled  up  in  the  transformation,  it  is 
no  longer  appropriate  to  separate  them;  instead,  we’ll  consider  a  combined  system  of  four  dimensions  known 
as  spacetime.  As  proper  physicists,  we  should  however  not  compare  apples  and  oranges,  or  time  and  space. 
We  already  converted  the  time  coordinate  to  a  space  coordinate  by  multiplying  it  with  c.  In  equations  (  ) 

and  (  11c)  we  canceled  that  c  in  front  of  t  with  a  c  in  the  denominator,  but  it  is  cleaner  to  put  it  back,  so  we 

get  an  even  better  sense  of  the  equality  of  space  and  time  in  the  Lorentz  transformation: 

x'  =  y(z/)(x- {ul c)ct) ,  (11.12a) 

ct'  =  y{u){ct- [ul c)x) .  (11.12b) 

Note  that  in  equations  (  12)  the  velocity  u  only  appears  as  a  fraction  of  c:  we  only  have  expressions  of  the 

form  ul  c,  making  all  the  coefficients  of  our  transforms  nicely  dimensionless. 

Second,  all  equations  in  this  section  are  for  a  transformation  between  a  stationary  frame  and  one  moving 
in  the  positive  x-direction  with  speed  u.  Since  we’re  in  principle  free  to  choose  our  coordinates,  we  can  always 
re-label  or  construct  our  axes  to  match  this  setup.  In  practice,  that  may  not  always  be  handy  though,  so  we 
could  also  consider  movement  into  a  different  direction.  Of  course,  moving  in  either  the  y  or  z  direction  just 
makes  those  axes  swap  with  the  x  axis  considered  here,  so  we  won’t  bother  to  explicitly  write  down  those 
transformations.  We  can  also  write  down  the  transformation  for  movement  in  a  general  direction  u: 

,  ux  r{u) 

x  —  x  +  (y(«)  -  1) - u - uct, 

u-  u  c 
,  (  ux\ 

ct  —  7  (w)  [ct - J , 

where  u-\u\  is  the  speed  of  the  moving  frame,  and  x  =  [x,  y,  z). 

Finally,  we  note  that  the  collection  of  Lorentz  transformations  in  the  x  direction  form  a  group  under 
composition.  If  a  system  S'  moves  with  respect  to  S  with  velocity  u,  and  S"  moves  with  respect  to  S'  with 
velocity  v,  then  you  can  make  a  Lorentz  transformation  from  S  to  S'  and  from  S'  to  S",  but  also  from  S  to  S" 
direcdy3.  As  you  can  check  for  yourself  (Problem  .  ),  the  transformation  from  S  to  S"  is  indeed  another 
Lorentz  transformation.  The  velocity  of  S"  with  respect  to  S  is  again  not  u+v,  but  ( u  +  t/)/(l  +  uvlc2),  as  given 
by  the  relativistic  velocity  addition  equation  (  )  derived  below. 

11.3.  Some  consequences  of  the  Lorentz  transformations 

11.3.1.  MSS  OF  SIMULTANEITY 

If  two  events  are  simultaneous  for  a  moving  observer  in  S',  the  observer  measures  their  time  interval  as  At'. 
If  the  two  events  happen  at  the  same  position  (Ax'  =  0),  the  Lorentz  transformations  give  Ax  =  0,  At  =  0 
as  well.  However,  if  the  two  events  in  S'  are  spatially  separate  (Ax'  f  0),  we  find  that  for  an  observer  in  S 
At  =  y{ii){ul  c2)Ax' ,  and  therefore  the  two  events  are  not  simultaneous.  Things  even  get  worse:  suppose 
two  events  A  and  B  happen  in  S  a  distance  Ax  apart,  and  a  time  interval  At  after  each  other.  Now  if  Ax  > 
{cl  u)c\t,  a  moving  observer  in  S'  will  conclude  that  At'  <  0,  which  means  that  event  B  happens  before  event 
A!  Fortunately,  this  does  not  violate  causality,  as  a  signal  from  A  to  B  (or  vice  versa)  will  at  most  travel  with  the 
speed  of  light,  which,  as  we  will  see  in  the  next  section,  means  that  for  the  conditions  given,  A  and  B  cannot 
be  causally  connected  -  i.e.,  you  cannot  reverse  cause  and  effect,  no  matter  how  fast  you  run. 


2  These  transformations  in  a  given  direction  are  sometimes  also  referred  to  as  Lorentz  boosts. 

3Think,  for  example,  of  S  as  a  stationary  platform,  S'  as  a  moving  train,  and  S"  as  a  toy  train  in  the  moving  train. 
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11.3.2.  Time  dilation  and  Lorentz  contraction 

A  stationary  observer  in  frame  S'  measures  the  time  difference  between  two  points  to  be  At'  on  his/her  own 
clock,  while  an  observer  in  S  will  measure  the  time  difference  on  that  (moving)  clock  to  be  At  =  y{u)At' , 
exactly  the  time  dilation  result  we  found  in  equation  (10.  ).  Likewise,  an  observer  in  S'  will  measure  the 
length  of  a  stationary  stick  to  be  A L' .  For  an  observer  in  S,  using  a  method  that  reaches  the  ends  of  the 
stick  simultaneously  (so  At  =  0),  the  length  is  A L.  We  have  Ax'  =  A L'  =  j[u)AL,  so  A L  =  AL' which 
(unsurprisingly)  is  the  Lorentz  contraction  result  of  equation  (  0.3). 


11.3.3.  Velocity  addition 

We  calculated  the  speed  of  an  object  v  as  measured  in  S  as  a  function  of  the  speed  v'  in  S'  and  the  speed  u  of 
S'  in  equation  (  ).  Substituting  the  values  of  the  constants  we  found  later,  we  get  the  following  equation: 


u  +  v' 

1  +  uv'  lc2 


(11.14) 


Equation  (  )  thus  follows  directly  from  the  light  postulate  -  that  is  all  we  used  to  derive  it.  It  mathemati¬ 

cally  shows  you  can  never  add  velocities  in  such  a  way  as  to  exceed  the  speed  of  light.  Setting  u-  v'  =  c  gives 
v  =  c,  and  for  any  values  u  <  c,  v'  <  c,  you’ll  always  get  v  <  c. 

Equation  (  )  holds  for  motion  in  the  same  direction  as  the  motion  of  the  reference  frame  -  for  example, 

if  you’re  on  a  moving  train,  and  rolling  a  ball  down  the  length  of  the  train.  However,  you  could  also  roll  the 
ball  in  the  transverse  direction  (say  y  if  we  call  the  direction  in  which  the  train  is  moving  x).  You  might 
think  that  the  observed  velocity  for  the  comoving  and  stationary  observer  is  the  same  in  that  case  (it  is  for 
Galilean  transformations),  but  that’s  not  the  case.  We  have  vy  -  Ay  I  At,  and  although  dy  is  invariant,  At  is 
not.  Calculating  vy  in  terms  of  v'y  (the  speed  at  which  the  moving  observer  rolls  the  ball)  is  straightforward 
though,  we  simply  apply  the  Lorentz  transformation  to  df: 


dy  dy'  1  Ay' I  At'  1  v'y 

y  At  y(w)d(r'  +  jsx')  y(w)  1+  jjAx'/dt'  y(u)  1  +  uv'x/c2 


(11.15) 


11.3.4.  Example  application:  relativistic  headlight  effect 

Suppose  you  have  a  light  source  that  radiates  isotropically  (i.e.,  with  the  same  intensity  in  all  directions).  What 
happens  if  we  put  the  light  source  on  a  moving  train?  Remarkably,  according  to  a  stationary  observer,  the  light 
source  is  not  isotropic  anymore.  To  understand  what  happens,  let  us  as  usual  call  the  direction  in  which  the 
train  moves  x  and  its  speed  u.  A  ray  of  light  emitted  by  the  light  source  in  S'  has  a  velocity  v'  with  magnitude  c 
(on  which  both  observers  agree),  and  components  {v'x,  v'y,  v'z)  (figure  a).  Now  let’s  consider  the  ray  of  light 
that  moves  along  the  y'  axis.  Its  velocity  is  given  by  v'  =  (0,  c,0).  We  can  calculate  the  velocity  components 
of  this  ray  of  light  in  frame  S  using  the  velocity  transformation  equations  (  )  and  (  15),  which  gives 

v  =  [u,  c/y(u),  0)  (which  of  course  still  has  magnitude  c).  The  ray  of  light  thus  picks  up  a  component  in  the 
positive  x  direction,  and  consequently  gets  a  smaller  component  in  the  y  direction.  Figure  ,2b  shows  the 
resulting  light  cone  in  the  positive  x  direction.  Its  opening  angle  6  can  be  easily  calculated: 

Vy  Vy 

sin  m  =  -r-  =  —  =rM-  (11.16) 

\v\  c 

For  reasons  that  you  will  probably  find  easy  to  guess,  this  phenomenon  is  known  as  the  relativistic  headlight 
effect.  It  is  observed  in  the  radiation  emitted  by  electrons  rotating  around  magnetic  held  lines  orbiting  Jupiter 
and  the  sun,  as  well  as  in  particle  accelerators  on  earth. 


11.4.  RAPDITY  AND  REPEATED  LORENTZ  TRANSFORMATIONS* 

As  stated  at  the  end  of  section  2,  the  composition  of  two  Lorentz  transformations  is  again  a  Lorentz  trans¬ 
formation,  with  a  velocity  boost  given  by  the  ‘relativistic  addition’  equation  (  )  (you’re  asked  to  prove  this 

in  problem  .  ).  You  could  of  course  repeat  this  process  for  successive  transformations,  but  the  repeated 
addition  of  velocities  quickly  leads  to  impractical  expressions.  You  could  also  investigate  whether  the  combi¬ 
nation  of  a  Lorentz  transformation  in  the  x  direction  and  one  in  the  y  direction  again  gives  a  Lorentz  trans¬ 
formation.  The  answer  is,  in  general,  no:  it  is  the  combination  of  a  Lorentz  transformation  and  a  rotation. 
In  some  sense,  we  can  also  consider  Lorentz  transformations  themselves  as  ‘rotations’  in  (4-dimensional) 
spacetime.  We’ll  discuss  spacetime  in  more  detail  in  the  next  two  sections.  Here,  we’ll  work  out  a  different 
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a)  y'  b)  y 


Figure  11.2:  Relativistic  headlight  effect,  (a)  Isotropic  light  source  in  the  comoving  frame  S',  (b)  The  same  light  source  as  observed  from 
a  stationary  frame  S.  The  opening  angle  8  is  given  by  sin(0)  =  y{u). 


way  of  writing  the  Lorentz  transformations  that  shows  their  relation  to  rotations.  As  a  bonus,  it  will  allow  us  to 
easily  calculate  the  speed  of  the  nth  Lorentz  transformation  (starting  from  rest,  all  in  the  positive  x  direction). 

Let  us  again  write  the  Lorentz  transformation  as  a  matrix.  Using  the  y(  u)  factor  and  introducing  (H  u)  = 
ul  c,  we  have 


ct 


=  T(m) 


1  P 
P  1 


(11-17) 


again  beautifully  illustrating  the  symmetry  between  time  and  space.  We  now  define  the  rapidity  <p  by 


We  then  have 


tanh(rf>)  =  h{u)  -  — . 

c 


y{u)  = 


1 

\/l  -p{u)2 


y(u)p{u)  = 


P{u) 

n/1  -Piu)2 


- =  cosh  <p, 

y/l  -  tanh2  cf> 

tanh  <p 

—  —  =  sinh(/>. 

y/l  -  tanh2  (p 


(11.18) 


Substituting  these  expressions  back  into  the  Lorentz  transformations  (  ),  we  get 


I  x  \  _i  cosh (p  sinh0  W  x'  ) 
\  ct  j  ^  sinh0  cosh0  j\ct'j 


which  closely  resembles  the  expression  for  a  rotation. 

We  can  likewise  rewrite  the  equation  for  velocity  addition  in  terms  of  the  rapidity.  Suppose  we  want  to 
add  velocities  u  and  v,  let  the  resulting  velocity  be  w,  then: 


tanh(0u,)  =  — 
c 


ulc+  vie 
1  +  uvlc2 


Pu  +  Pv 
1  +  PuPv 


tanh(0„)  +  tanh(0„) 
1  +  tanh((/>„)  tanh(0„) 


=  tanh  {(pu  +  (pv ) 


(11.20) 


4If  you’re  unfamiliar  with  the  hypergeometric  functions:  they’re  defined  like  the  trigonometric  functions  as  combinations  of  powers  of  e, 
except  that  we  drop  the  complex  number  i,  so  we  have 


sinh(<)>)  =  -  - 


-<!>] 


cosh  (</>)  =  -[e^-re 


,  tanh(</>)  = 


sinh(0)  -  e 


cosh  (0)  e'l’  + 

Note  that  d sinh 0/  cT/>  =  cosli 0  like  the  trigonometric  counterpart,  but  dcosh (pi  dip  =  sitili0,  so  no  minus  sign  in  this  case. 
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(in  problem  .9  you  get  to  prove  the  last  equality) .  We  thus  find  a  very  simple  addition  rule  for  the  rapidities: 


<pw  =  <pu  +  <pv- 


(11.21) 


Suppose  now  that  we  have  a  (stationary)  reference  system  S,  a  system  S'  that  moves  with  speed  u  (and  rapid¬ 
ity  <p)  with  respect  to  S,  a  system  S"  that  moves  with  speed  u  with  respect  to  S',  and  so  on.  By  equation  (  ), 

the  system  S(";i  then  moves  with  rapidity  </>„  =  mp  with  respect  to  S.  To  find  the  relative  speed  un  at  which 

SW 

moves  in  S,  we  invert  the  definition  of  the  rapidity,  which  gives  us 


l+l 

l-P 


(11.22) 


so  un  is  given  by 


un  — 


M 

i+( 


1  -u/c\ 
1+ ul  c  ) 
1  -u!c\ 
1+u/cJ 


n 


n 


C. 


(11.23) 


Note  that  equation  (  1.23)  provides  another  proof  of  the  statement  that  no  inertial  object  can  move  at  the 
speed  of  light:  for  arbitrary  large  values  of  n,  un  remains  less  than  c,  so  no  matter  how  many  velocity  boosts 
you  give  your  massive  particle,  you  can  never  make  it  move  at  the  speed  of  light,  let  alone  exceed  that  speed. 
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11.5.  Problems 

11.1  Suppose  system  S'  moves  with  respect  to  S  with  velocity  u,  and  S"  moves  with  respect  to  S'  with  velocity 
v.  Show  that  system  S"  and  S  are  related  through  a  Lorentz  transformation  with  velocity 

u+v 


11.2  Equation  (  )  gives  the  Lorentz  transformation  for  an  observer  moving  in  an  arbitrary  direction  u. 

(a)  Show  that  if  a  ray  of  light  is  emitted  in  the  rest  frame  S  in  an  arbitrary  direction,  such  that  its  trajec¬ 
tory  is  described  by  x  -  ct  (where  c-c-c2),  then  in  the  moving  frame  S',  the  trajectory  of  the  ray  of 
light  is  also  a  straight  line,  given  by  x'  =  c' t' ,  with  c'  ■  c'  =  c2. 

(b)  What  is  the  direction  of  the  ray  of  light  in  S’? 

11.3  (a)  A  male  and  a  female  student  both  attend  lectures  on  relativity.  Afterwards,  they  return  home  by 

train,  moving  in  opposite  directions,  each  at  (4/5)  c.  Before  they  left,  they  promised  to  send  each 
other  messages  while  on  the  train.  Unfortunately  there  is  interference  in  the  phone  network,  so 
they  can’t  use  their  phones  to  do  so.  They  do  have  pen  and  paper  though,  so  they  could  write  down 
their  message  and  throw  it  to  the  other  train.  The  boy,  having  only  paid  attention  during  the  hi  st  part 
of  the  lecture  (being  distracted  by  the  girl  afterwards)  remembers  that  nothing  can  go  faster  than  the 
speed  of  light,  and  concludes  that  throwing  something  is  pointless,  as  the  (classical)  relative  speed 
of  the  trains  exceeds  the  speed  of  light.  The  girl,  who  paid  attention  throughout,  realizes  this  is  not 
the  case,  and  explains  why  in  her  note  to  the  boy.  At  which  minimum  velocity  should  she  throw  the 
note  so  that  it  can  reach  the  boy’s  train? 

(b)  The  boy,  having  received  the  girl’s  message,  realizes  that  he  has  a  much  better  chance  of  completing 
the  assignments  if  he  can  ask  her  some  more  questions.  He  therefore  leaves  his  train,  and  takes 
the  next  one  back  (so  now  traveling  in  the  same  direction  as  the  girl).  Unfortunately,  this  train  only 
moves  at  (3/ 5)  c,  so  it  won’t  catch  up  with  the  girl’s  train.  The  boy  consoles  himself  with  the  thought 
that  relative  to  him,  the  girl  is  moving,  so  her  clock  is  running  slow,  and  at  least  she  won’t  have 
forgotten  about  him  by  the  time  she  leaves  her  train.  An  hour  passes  on  the  boy’s  watch.  How  much 
time  (according  to  him)  has  the  girl’s  watch  advanced  in  that  period  (assuming  they  both  stay  on 
their  trains)? 

1 1 .4  An  observer  on  Earth  sees  two  spaceships  (or  trains,  whatever  you  prefer)  approaching  from  opposite  di¬ 
rections.  The  observer  measures  their  velocities  in  his/her  own  rest  frame,  and  not  knowing  about  rela¬ 
tivity,  uses  Galilean  velocity  addition  to  conclude  that  the  two  spaceships  have  a  relative  speed  of  (7/5)  c. 
However,  an  observer  on  one  of  the  spaceships  measures  the  relative  speed  of  the  other  as  (35/37 )c.  Find 
the  speeds  of  the  two  spaceships  relative  to  the  Earth. 

11.5  A  male  and  a  female  student  both  attend  relativity  lectures.  The  boy  however  is  more  interested  in  the 
girl  than  in  the  lecture.  Frustrated,  the  teacher  throws  a  wet  sponge  towards  him,  at  speed  c/2.  The  girl, 
hoping  to  save  the  boy,  tries  to  intercept  the  sponge,  throwing  her  marker  at  it  from  the  side  (making  a 
right  angle,  i.e.,  coming  at  the  sponge  from  a  direction  perpendicular  to  its  direction  of  motion,  as  seen 
in  the  (stationary)  reference  frame  of  the  lecture  hall)  with  speed  c/3.  For  a  spider  who  happens  to  sit 
on  the  sponge,  what  is  the  measured  speed  of  the  marker? 

11.6  A  spaceship  flies  away  from  Earth  with  speed  cl 3>  After  some  time  a  small  shuttle  departs  from  the 
spaceship,  in  a  direction  that  makes  a  right  angle  with  that  of  the  main  ship,  and  a  speed  of  c/4,  as 
measured  in  the  rest  frame  of  the  main  ship.  What  are  the  magnitude  and  direction  of  the  velocity  of  the 
shuttle  as  measured  from  Earth? 

11.7  Angles  in  Lorentz  transformations 

(a)  A  rod  moves  with  velocity  v  in  a  straight  line  relative  to  an  inertial  frame  S.  In  its  rest  frame  the  rod 
makes  an  angle  of  O'  with  the  forward  direction  of  its  motion.  Find  the  angle  6  the  rod  appears  to 
make  with  the  direction  of  motion  as  measured  in  the  frame  S.  Determine  the  numerical  value  of 
this  angle  for  6'  -  60°  and  v  -  3c/5. 
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Figure  11.3:  Observing  the  explosion  of  a  quasar  at  large  distance. 


(b)  As  observed  from  an  inertial  frame  S,  a  mirror  is  moving  with  speed  v  in  the  x  direction  with  its 
plane  surface  perpendicular  to  x.  Also  as  observed  in  frame  S,  a  photon  traveling  in  the  xy- plane  is 
incident  on  the  mirror  surface  with  an  angle  8  to  the  mirror’s  normal.  Show  that,  as  seen  from  S,  the 
reflected  photon  has  an  angle  8  with  the  mirror  normal,  where 


cos  8  = 


cost?  +  cos  ct 
1  +  cos  8  cos  a  ’ 


(11.24) 


and  cosa  =  2(Wc)/(l  +  ( vie )2). 

11.8  Quasars  are  active  galactic  nuclei  that  can  emit  large  amounts  of  matter,  usually  gas.  Fortunately  they 
are  far  enough  away  to  never  affect  us,  but  close  enough  that  we  can  measure  e.g.  the  velocity  of  the 
emitted  gas.  These  velocities  occasionally  seem  to  exceed  the  speed  of  light.  To  see  how  this  can  happen, 
consider  a  quasar  a  distance  d  away  from  Earth  (as  measured  in  the  collective  rest  frame  of  the  Earth 
and  the  quasar),  and  an  explosion  on  the  quasar  resulting  in  such  an  emission  at  t  —  0,  see  figure 
The  light  emitted  at  the  moment  of  the  explosion  reaches  Earth  at  t  —  die.  If  the  emitted  gas  is  moving 
on  a  trajectory  like  the  one  shown  in  the  figure  below,  light  emitted  from  the  gas  has  to  travel  a  shorter 
distance  to  Earth  than  the  light  emitted  at  the  moment  of  the  explosion.  Light  emitted  by  an  explosion 
on  a  distant  quasar  reaches  Earth  after  a  time  interval  die.  Light  emitted  from  the  expelled  gas  has  to 
travel  a  shorter  distance.  Note  that  the  distance  traveled  by  the  gas  is  much  smaller  than  the  distance 
between  Earth  and  the  quasar,  so  the  light  arriving  from  both  can  be  taken  as  coming  from  the  same 
direction.  Suppose  that  the  gas  travels  at  speed  v. 

(a)  Determine  the  interval  Afg  between  the  events  ‘Earth  observer  sees  the  initial  explosion’  (which 
happens  at  the  quasar  at  t  —  0,  but  is  observed  later)  and  ‘Earth  observer  sees  the  light  emitted  by 
the  gas  at  time  t\ 

(b)  Determine  the  speed  of  the  gas  an  Earth  astronomer  would  measure  if  they  don’t  take  the  angle  8 
into  account  (we  call  this  speed  i/„bs)- 

(c)  Show  that  the  observed  velocity  can  exceed  the  speed  of  light. 

(d)  Show  that  for  given  actual  velocity  v  of  the  gas,  the  observed  velocity  is  maximized  if  sin0  =  vie,  and 
that  in  that  case  we  get  i>0bs  =  y{v)  v. 

(e)  What  is  the  minimum  speed  v  for  the  gas  at  which  it  can  appear  to  have  a  speed  equal  to  that  of 
light? 
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11.  LORENTZ  TRANSFORMATIONS 


11.9  [For  section  ]  Prove  the  last  equality  in  equation  (  1.2  )  by  expanding  the  hyperbolic  tangents  in  the 
fraction  in  exponential  functions. 

11.10  [For  section  4]  Prove  equation  (  1.23)  by  induction.  If  this  is  the  first  time  you  prove  something  by 
induction:  step  1  is  to  prove  that  the  equation  holds  for  n  -  1  (completely  trivial  in  most  cases) ;  step  2 
is  to  prove  that  if  the  equation  holds  for  all  values  up  to  n,  it  also  holds  for  n  +  1.  In  this  case,  you  thus 
have  to  calculate  un+\  by  (relativistically)  adding  u  and  un. 


12 

Spacetime  diagrams 


To  study  the  effect  of  Lorentz  transformations  on  the  observations  done  by  people  in  different  inertial  refer¬ 
ence  frames,  we  will  make  use  of  a  handy  tool:  the  spacetime  diagram  (figure  12  ,  sometimes  also  called  the 
Minkowski  diagram).  At  first  glance,  these  diagrams  are  very  similar  to  (time,  space)  plots  you’re  probably 
familiar  with,  except  with  the  axes  swapped  (figure  12  a).  We  plot  distance  horizontally,  and  time  (or  rather, 
c  times  time)  vertically.  A  stationary  object  then  has  a  vertical  ‘trajectory’  (known  as  its  worldline ),  and  a 
moving  object  a  trajectory  with  slope  d  v,  with  v  the  object’s  speed.  We  can  go  to  a  comoving  frame  through 
a  Galilean  coordinate  transformation  (figure  b,  equation  ).  The  comoving  frame  S'  has  an  x-axis  x' 
that  coincides  with  that  of  the  stationary  frame,  and  a  time  axis  ct'  that  has  the  same  slope  as  the  speed  of  the 
moving  object.  Consequently,  all  point’s  on  the  object’s  trajectory  have  the  same  space  coordinate,  whereas 
time  keeps  on  running  at  the  same  pace  as  before.  (This  might  seem  counterintuitive  -  didn’t  we  change 
time?  No,  we  rotated  the  time  axis,  but  that  doesn’t  change  time  itself,  as  time  coordinates  are  determined  by 
projecting  on  the  time  axis  parallel  to  the  space  axis.  Space  changed:  points  are  not  projected  on  the  same 
spot  on  the  x  and  x!  axis,  even  though  the  axes  themselves  coincide). 

What  about  the  Lorentz  transformations?  To  study  them,  we  need  to  consider  what  happens  to  the  speed 
of  light,  which  by  the  light  postulate  must  remain  unchanged.  Because  of  our  choice  of  axes  (which  includes 
a  choice  of  units),  light  travels  along  a  line  of  slope  1  in  our  spacetime  diagram  (figure  12.1c).  Anything  slower 
than  light  has  a  trajectory  with  a  slope  larger  than  1.  To  have  a  trajectory  with  slope  less  than  1,  you’d  need  to 
go  faster  than  light.  Therefore,  you  cannot  travel  from  the  origin  into  the  region  closer  to  the  space  than  to  the 
time  axis.  Objects  can  be  there  of  course  -  but  if  you  start  at  the  origin,  you  can’t  travel  there,  or  even  see  these 
objects,  as  not  even  light  can  travel  fast  enough  to  connect  you.  Weil  come  back  to  this  point  in  section  12.2. 
For  now,  we  note  that  the  Lorentz  transform  must  map  the  line  of  slope  Iona  line  of  slope  1.  The  only  way  to 
do  so  is  to  rotate  the  space  and  time  axes  both,  and  both  by  the  same  amount,  as  is  done  in  figure  d.  Here 
time  and  space  have  both  changed:  the  stationary  (black,  (x,  f))  and  moving  (blue,  (x\  t'))  observer  measure 
different  values  for  the  time  and  space  coordinates,  as  given  by  the  Lorentz  transformations  (  ).  To  find 

the  angle  a  over  which  the  axes  must  rotate,  consider  a  stationary  object  in  S':  its  worldline  must  coincide 
with  the  ct'  axis,  while  it  moves  with  velocity  u  in  frame  S,  so  the  slope  of  the  ct'  axis  in  S  must  be  chi,  and 
we  have  tan(a)  =  u/c. 

A  subtle  but  important  point  is  that  the  Lorentz  transformations  do  not  only  change  the  orientation  of 
the  time  and  space  axis,  but  also  their  units  (which  is  what  you  might  have  expected,  as  you  know  about  time 
dilation  and  length  contraction  already).  To  see  how  this  works,  consider  the  point  (x',  ct')  =  (0, 1)  in  S'  (so  a 
unit  distance  away  from  the  origin  on  the  time  axis).  The  Lorentz  transformations  give  us  the  coordinates  of 
this  point  in  S:  (x,  ct)  =  (j[u){ul  c),j(u))  —  y(u)((u/c),  1).  The  length  of  this  interval  in  S'  is  1;  the  length  of  the 
same  interval  in  S  is  j{u)  \/l  +  (u/c)2.  We  conclude  that  the  units  of  ct'  and  ct  are  related  through  this  factor. 
A  completely  analogous  calculation  shows  that  the  same  holds  for  the  units  of  x!  and  x  (as  again  we  might 
have  guessed,  due  to  the  symmetry  of  time  and  space  now  present  in  our  system),  so  we  have: 


ct'  unit 
c  t  unit 


1  +  {u/c)2  x' unit 
1  -(u/c)2  xunit 


(12.1) 
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12.  Spacetime  diagrams 


Figure  12.1:  Spacetime  or  Minkowski  diagrams.  Horizontal  axis  depicts  a  spatial  coordinate,  vertical  axis  time  (times  c).  Note  that  this 
choice  of  axes  is  opposite  to  the  ordinary  choice  in  classical  mechanics,  (a)  Object  (orange  line)  moving  in  space  and  time  at  speed 
v  =  ( X2  -  x\)l{t2  -  t\).  A  stationary  object  would  trace  out  a  vertical  line.  Time  and  space  coordinates  of  a  certain  point  are  determined 
by  projection  on  their  respective  axes,  along  a  (dashed)  line  parallel  to  the  other  axis,  (b)  Galilean  coordinate  transformation  to  the 
comoving  frame  (x’,  ct’).  Note  that  it  is  the  time  axis  that  turns  -  consequendy,  both  the  start  and  end  points  of  the  trajectory  of  the 
moving  object  are  mapped  to  the  same  coordinate  in  x’  (so  the  object  is  stationary  in  the  comoving  frame).  Because  the  x-axis  has  not 
changed,  the  time  coordinates  in  the  comoving  frame  are  the  same  as  in  the  stationary  one.  (c)  Trajectory  of  a  light  beam  in  a  stationary 
frame.  The  light  beam  has  speed  c,  which  means  slope  1  because  we  multiplied  t  by  c.  Any  massive  object  moving  in  spacetime  has  a 
velocity  less  than  c  and  so  a  slope  of  more  than  1,  and  lies  closer  to  the  time  axis.  To  have  a  trajectory  with  slope  less  than  1,  the  object 
would  have  to  move  faster  than  light.  Note  that  objects  can  be  present  in  this  part  of  the  diagram,  but  can't  have  traveled  there  from  the 
origin,  (d)  Lorentz  transformation  to  the  frame  (x’,  ct’) .  To  keep  the  speed  of  light  constant,  the  axes  have  to  rotate  by  the  same  angle, 
illustrating  the  symmetry  between  space  and  time.  Under  a  Lorentz  transformation,  both  space  and  time  coordinates  of  any  point  that 
is  not  the  origin  change,  as  illustrated  -  coordinates  are  still  determined  by  projection  on  the  respective  axis,  parallel  to  the  other  axis. 


12.1.  Time  dilation  and  space  contraction  revisited 

As  we’ve  already  seen  twice,  Lorentz  transformations  do  funny  things  with  the  measurement  of  time  and 
space.  Here,  we’ll  study  these  effects  once  more  using  spacetime  diagrams.  First  we’ll  consider  time.  Suppose 
you  and  a  friend  come  together  at  some  point  in  space  and  time,  which  we’ll  call  O  (for  origin,  obviously).  You 
synchronize  your  (perfect)  watches.  Your  friend  next  takes  off  at  speed  u  in  the  x-direction,  while  you  remain 
stationary.  Your  inertial  frame  is  thus  S  (the  black  frame  in  figure  2.2a),  whereas  your  friend’s  inertial  frame 
is  S'  (blue  frame  in  figure  12.2).  After  some  time  t\  has  passed  on  your  watch,  you  whip  out  a  telescope  and 
take  a  look  at  your  friend’s  watch  -  and  see  that  it  is  lagging  behind  yours. 

To  see  why  your  friend’s  watch  appears  to  be  slow,  consider  the  observation  lines  in  figure  12. i  a.  You  both 
started  at  the  origin.  You  have  not  moved  in  space  since,  only  in  time,  so  your  position  in  spacetime  coincides 
with  the  c  t  axis.  Your  friend  is  moving  in  both  time  and  space,  so  in  your  coordinates  your  friend’s  trajectory  is 
a  sloped  line,  but  of  course  in  the  comoving  coordinates  of  S'  your  friend  is  also  stationary,  so  (s)he  is  moving 
on  the  ct'  axis.  When  you  look  at  your  friend  at  t  -  t\ ,  you  observe  him/her  at  t’  =  t[  in  S',  as  shown  by  the 
black  arrow.  From  your  point  of  view,  t\  and  t[  are  simultaneous,  because  they  lie  on  a  line  parallel  to  your 
space  axis.  However,  from  your  friend’s  point  of  view,  these  events  are  not  simultaneous  at  all!  Instead,  in 
S',  t\  is  simultaneous  with  events  on  the  line  parallel  to  the  xl  axis  -  so  point  to  (which  you  already  passed) 
on  your  time  axis.  You  can  find  to  by  projecting  from  rj  along  the  x'  axis  on  the  ct  axis  -  the  green  arrow  in 
figure  2  a.  By  moving  away  from  you,  your  friend’s  time  thus  seems  to  move  slower,  a  phenomenon  we  call 
time  dilation.  Of  course,  we  can  also  find  out  at  which  point  your  friend’s  watch  indicates  t\ ,  by  backtracking 
the  upper  green  arrow  in  figure  2.2a,  which  shows  that  that  time  corresponds  to  your  later  time  t2. 

We  now  have  two  ways  to  calculate  how  much  time  seems  to  slow  down  for  your  moving  friend.  On  the 
one  hand,  the  spacetime  diagrams  in  figure  !.2  simply  represent  Lorentz  transformations.  Since  you  haven’t 
moved  in  space,  x=0,  and  equation  gives  ct'  -  j{u)ct,  so  yin)  is  the  dilation  factor.  Alternatively,  we 
could  use  that  in  S,  t\  and  t[  are  observed  to  be  simultaneous.  We  already  know  that  the  slope  of  the  ct'  line 
in  S  is  given  by  tan(a)  =  ulc,  where  a  is  the  angle  between  the  ct  and  ct '  axes.  We  also  know  how  the  units  on 


1  Well,  not  really  -  it  takes  time  for  the  light  from  your  friends  watch  to  reach  you.  We’ll  assume  you're  smart,  and  have  corrected  for  this 
already. 


12.2.  An  invariant  measure  of  length 
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Figure  12.2:  Spacetime  diagrams  showing  (a)  time  dilation  and  (b)  length  contraction  of  a  moving  object  (blue  frame,  S ')  as  observed 
from  a  stationary  point  of  view  (black  frame,  S). 


the  ct'  axis  relate  to  those  in  S  (equation  12.1 ).  Putting  all  these  together,  we  have: 


t\  =  cos  (a)  •  (unit  conversion  factor)  •  t\ 


\A  +  (u/c) 


1  /l  +  (n  let  2 


1 


(12.2) 


A  similar  thing  happens  for  lengths.  Of  course,  in  our  new  four- dimensional  spacetime,  we  would  expect 
so,  as  time  and  space  are  now  intimately  linked.  Suppose  you  have  an  object  of  length  x\ ,  that  you  position 
on  your  x-axis  at  t  =  0,  with  one  end  at  the  origin  and  one  end  at  x  =  xi  (figure  b).  Now  consider  an 
identical  moving  object  that  passes  you  by  at  speed  u  at  t  -  0.  How  long  do  you  measure  that  object  to  be? 
One  end  is  at  the  origin  of  the  S'  frame,  which  coincides  with  the  origin  of  your  frame;  the  other  end  sits  at 
x'  =  x\  on  the  x'-axis,  which  you  find  by  looking  along  your  own  time  axis  (black  vertical  arrow).  To  project 
back  to  your  axis,  you  have  to  project  from  x\  on  your  space  axis  along  the  time  axis  of  S'  -  that’s  the  green 
arrow,  which  projects  x!x  on  some  point  xq  <  x\.  From  equation  (  12)  we  find  x\  =  y(u)X| ,  so  the  length  of 

the  object  appears  shortened  by  a  factor  y(u),  or  Lorentz  contracted. 

Note  that  time  dilation  and  length  contraction  are  really  the  same  thing  -  parts  a  and  b  of  figure  12.2  are 
identical,  only  reflected  in  the  diagonal  line  with  slope  1  that  represents  the  path  taken  by  light.  It  is  therefore 
also  not  surprising  that  the  contraction/ dilation  factor  y(zr)  is  the  same  in  both  cases. 

A  subtle  but  important  point  is  that  time  dilation  and  length  contraction  are  observations  you  make  on 
an  object  moving  relative  to  you.  As  long  as  you’re  in  an  inertial  reference  frame,  you  can  always  define  your 
own  frame  to  be  the  stationary  one  -  simply  pick  the  one  that  is  moving  with  you.  However,  your  friend  can  of 
course  do  exactly  the  same.  You  see  your  friend  moving  along  your  positive  x-axis  at  speed  u  -  and  your  friend 
sees  you  moving  along  their  positive  x-axis  at  speed  u,  the  only  difference  being  that  they  define  ‘positive’  the 
opposite  direction  you  do  (but  as  this  is  entirely  arbitrary,  there  is  no  ‘right’  or  ‘wrong’  choice).  Therefore, 
your  friend  also  sees  your  watch  go  slower,  and  your  lengths  contracted '! 


12.2.  An  invariant  measure  of  length 


As  we’ve  seen  in  the  previous  section,  measures  of  length  and  time  intervals  become  relative  in  the  new  real¬ 
ity  that  is  created  by  the  light  postulate.  That’s  not  very  convenient:  we’d  like  to  be  able  to  agree  on  the  value 
of  quantities  independent  of  which  inertial  reference  frame  we  use  to  measure  them  in.  In  ordinary  space, 
the  method  we  use  to  write  down  vectors  presents  us  with  a  similar  problem:  we  write  our  vector  compo¬ 
nents  with  respect  to  a  basis  or  coordinate  system,  and  so  choosing  a  different  set  of  coordinates  changes 
the  numerical  value  of  the  components  of  the  vector.  Of  course,  the  actual  vector  doesn’t  change  -  a  physical 
quantity  like  a  velocity  or  a  force  really  doesn’t  care  what  specific  coordinates  you  use  to  measure  it.  However, 
if  you  and  your  collaborator  use  different  coordinates,  you  might  run  into  misunderstandings  when  compar¬ 
ing  notes.  Fortunately,  vectors  have  properties  whose  value  are  independent  of  the  coordinate  system  used 
to  determine  their  components.  The  most  important  one  is  their  magnitude,  which  we  sometimes  also  refer 
to  as  their  length.  If  you  know  a  vector’s  components,  calculating  its  magnitude  is  easy:  its  simply  the  square 
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Hermann  Minkowski  (1864-1909)  was  a  German  mathematician,  who 
used  geometrical  methods  to  address  problems  both  in  number  theory 
and  in  (mathematical)  physics.  Born  in  Russia,  Minkowski’s  Jewish  fam¬ 
ily  emigrated  to  Prussia  (in  present-day  Germany)  to  avoid  persecution. 
He  worked  at  various  universities  in  the  German-speaking  world,  includ¬ 
ing  ETH  in  Zurich,  where  he  was  one  of  Einstein’s  teachers.  In  1907, 
Minkowski  realized  that  the  theory  of  relativity  as  introduced  by  Einstein 
two  years  earlier  could  most  easily  be  understood  in  a  four- dimensional 
spacetime  in  which  lengths  can  be  calculated  with  equation  (12.  ) .  He  also 
developed  the  graphical  representation  named  after  him  (figure  12.1).  In 
September  1908,  Minkowski  gave  a  lecture  titled  ‘Space  and  time’  in  which 
he  laid  out  these  ideas,  starting  with  a  now  famous  quote:  “The  views  of 
space  and  time  which  I  wish  to  lay  before  you  have  sprung  from  the  soil 
of  experimental  physics,  and  therein  lies  their  strength.  They  are  radical. 
Henceforth  space  by  itself,  and  time  by  itself,  are  doomed  to  fade  away 
into  mere  shadows,  and  only  a  kind  of  union  of  the  two  will  preserve  an 
independent  reality.”  Einstein  built  on  Minkowski’s  geometrical  represen¬ 
tation  to  extend  the  special  theory  of  relativity  to  the  general  one  in  the 
1910's.  Sadly,  Minkowski  did  not  live  to  see  this,  as  he  died  of  appendicitis 
at  age  44. 


Figure  12.3:  Elermann  Minkowski 
in  1909  [  ]. 


root  of  the  dot  product  with  itself: 

x  =  |x|  =  y/x-x.  (12.3) 

Similarly,  although  measures  of  both  length  and  time  change  between  inertial  reference  frame  in  relativity 
theory,  there  is  a  combination  of  the  two  whose  value  is  independent  of  which  reference  frame  you  measure 
it  in: 

(As)2  =  (cAf)2  -  (Ajc)  •  (Ax)  =  (cAt)2  -  (Ax)2  -  (Ay)2  -  (Az)2.  (12.4) 

Again,  we  see  that  time  and  space  are  mixed!  Checking  that  the  ‘length’  As  is  indeed  invariant  when  you 
go  from  one  inertial  frame  to  another  (i.e.,  when  you  apply  a  Lorentz  transformation)  is  a  straightforward 
exercise.  You  may  wonder  where  the  idea  for  equation  ( 12.4)  came  from,  and  the  answer  is  the  same  as 
always  in  relativity:  from  the  light  postulate.  What  both  you  and  your  collaborator  will  agree  on  is  the  speed 
of  light,  which  (as  it  is  a  constant) ,  you  can  measure  in  your  own  coordinate  system  as  c  =  (A  r)  /  (A  t) ,  where 
Ar  =  \J (Ax)2  -  (Ay)2  -  (Az)2  is  the  distance  covered  by  a  light  beam,  and  A  t  the  time  it  took.  Rewriting, 
you  get  0  =  (cAf)2  -  (Ax)2  -  (Ay)2  -  (Az)2,  which  won’t  depend  on  the  coordinate  system  you  used  -  so  your 
collaborator  gets  the  same  answer  . 

In  contrast  to  the  length  of  an  ordinary  vector,  the  length  of  the  invariant  interval  {As)2  can  become  zero 
or  even  negative!  As  we’ve  seen  above,  the  interval  is  always  zero  for  the  path  taken  by  a  light  beam  (or 
anything  traveling  at  the  speed  of  light).  If  you  travel  at  speeds  slower  than  light  (including  standing  still), 
the  length  of  your  interval  will  come  out  positive.  A  negative  length  would  correspond  to  a  trajectory  that 
has  a  speed  faster  than  that  of  light,  which,  as  we’ve  seen  in  section  10.1,  is  impossible.  There  can  therefore 
be  no  communication  between  any  two  points  in  spacetime  that  are  separated  by  an  interval  with  negative 
length.  In  other  words,  they  cannot  be  causally  connected :  it  is  impossible  for  an  event  at  one  such  location 
to  influence  the  other3.  We  call  events  separated  by  an  interval  with  negative  length  spacelike  connected; 
those  connected  by  a  positive  interval  are  timelike  connected  (and  can  be  causally  linked),  and  if  the  interval 
is  zero,  the  connection  is  called  lightlike.  Lightlike  events  can  still  be  causally  linked,  as  information  can  be 
send  from  one  to  another  using  a  light  signal.  The  timelike,  lightlike,  and  spacelike  regions  correspond  to  the 
v  <  c,  v  —  c  and  v  >  c  regions  in  figure  c.  Together,  the  lightlike  and  timelike  regions  make  up  the  light 
cone. 

2  In  fact,  there  is  an  alternative  derivation  of  the  Lorentz  transformations,  that  starts  from  the  observation  that  due  to  the  light  postulate, 
we  must  have  (As)2  =  0  for  a  light  beam.  Additionally,  space  must  be  homogeneous  (the  same  at  every  point)  and  isotropic  (looking  the 
same  in  every  direction).  Together  with  the  statement  that  the  transformations  have  to  be  linear  due  to  the  principle  of  relativity,  you 
can  determine  the  components  of  the  transformation  matrix  from  these  observations  as  well. 

3  Note  that  we  are  talking  about  events  in  spacetime  here:  at  a  specific  point  in  space  and  time.  You  can  of  course  send  out  a  signal  to  a 
specific  location,  but  if  that  point  is  connected  to  the  present  you  through  a  spacelike  interval,  you  cannot  influence  it  (or  be  influenced 
by  it)  at  that  moment,  you  can  only  influence  its  future.  A  teacher  lecturing  a  class  can  thus  only  influence  the  future  students,  and  the 
students  in  turn  can  only  influence  the  future  teacher,  which  is  perhaps  one  reason  why  special  relativity  sometimes  leads  to  confusion. 
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Figure  12.4:  Spacetime  diagrams  showing  the  three  events  of  the  worked  example  in  section  12.2.  Events  1,  2  and  3  are  indicated  with 
black  dots;  the  light  cone  of  point  1  is  shown  in  blue,  that  of  point  3  in  red.  We  can  immediately  read  off  that  point  2  is  inside  the  light 
cone  of  point  1  (so  1  &  2  are  timelike  connected),  point  2  is  on  the  edge  of  the  light  cone  of  point  3  (so  2  &  3  are  timelike  connected)  and 
point  3  is  outside  the  lightcone  of  point  1  (so  1  &  3  are  spacelike  connected). 


12.2.1.  Worked  example:  Causal  connections 

Consider  three  events,  occurring  at  spacetime  coordinates  (1)  (cr,  x)  =  (1,2),  (2)  [ct,  x)  =  (5,4),  and  (3)  {ct,  x)  = 
(3,6).  Which  of  these  events  can  be  causally  connected? 

Solution 

Method  1 :  We  calculate  the  length  of  the  invariant  interval  (12.4)  for  all  three  pairs: 

(1  &  2):  (As)2  =  (cAf)2  -  (Ax)2  =  (1  -5)2  -  (2  —  4)2  =  12  >  0  so  this  interval  is  timelike. 

:  (1  &  3):  (As)2  =  (cAf)2  -  (Ax)2  =  (1  -  3)2  -  (2  -  6)2  =  -12  <  0  so  this  interval  is  spacelike. 

(2  &  3):  (As)2  =  (cAf)2  -  (Ax)2  =  (5  -  3)2  -  (4  -  6)2  =  0  so  this  interval  is  lightlike. 

We  conclude  that  events  (1  &  2)  and  events  (2  &  3)  can  be  causally  connected,  but  events  (1  &  3)  cannot. 

Method  2:  We  draw  a  spacetime  diagram  with  the  three  events,  and  their  (past  and  future)  light  cones,  see 
figure  .  We  can  immediately  read  off  that  events  (1  &  2)  and  events  (2  &  3)  can  be  causally  connected  (as 
they  are  in  or  on  each  other’s  light  cone),  but  events  (1  &  3)  cannot,  as  they  are  outside  each  other’s  light  cone. 

12.2.2.  The  invariant  interval  and  the  ordering  of  events 

We’ve  already  seen  in  section  that  moving  and  stationary  observers  will  only  agree  on  whether  two  things 

happen  simultaneously  if  they  happen  at  the  same  point  in  space  and  time.  Suppose  now  that  we  have  two 
events  that  according  to  a  stationary  observer  happen  in  a  certain  order.  Would  a  moving  observer  at  least 
agree  to  that  ordering?  The  answer  is,  in  general,  no:  if  the  two  events  are  separated  by  a  spacelike  interval, 
there  is  a  finite  speed  (less  than  c)  at  which  an  observer  has  to  move  for  the  events  to  occur  at  the  same  time 
on  their  clock  -  and  if  they  move  faster,  the  ordering  is  reversed!  Calculating  this  speed  is  straightforward:  if 
the  separation  intervals  in  the  stationary  observer’s  frame  are  cAf  and  Ax,  we  simply  find  the  velocity  u  for 
which  the  Lorentz  transformed  time  equals  cAt'  =  j{u){cAt-  {ulc) Ax)  =  0,  so 

u-{cAt/Ax)c.  (12.5) 

If  the  separation  between  the  two  events  is  lightlike,  equation  (12.5)  tells  us  that  we’d  have  to  move  at  the 
speed  of  light  for  them  to  occur  at  equal  times  at  our  clock.  For  timelike  separations,  we’d  need  to  move  even 
faster  than  light  -  so  then  we’re  forced  to  agree  on  the  temporal  order  of  events.  However,  once  again  space 
and  time  show  their  dual  nature:  for  a  timelike  separation,  there  may  not  be  a  speed  at  which  the  events 
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Figure  12.5:  Spacetime  diagrams  showing  hyperbolas  of  invariant  (As)2  in  both  the  stationary  (black  frame,  S)  and  moving  (blue  frame, 
S')  coordinates.  By  finding  the  points  where  a  hyperbola  intersects  the  time  or  space  axis  of  the  moving  frame,  we  can  graphically 
construct  the  units  in  that  frame,  given  the  units  in  the  stationary  frame.  Note  that  for  any  value  of  (As)2,  the  hyperbola  has  the  light 
cone,  with  (As)2  =  0,  as  its  asymptote. 


occur  at  the  same  time  for  a  moving  observer,  but  there  is  one  for  which  they  occur  at  the  same  location. 
Unsurprisingly,  that  speed  is  simply  Ax/At  -  this  specific  result  holds  in  classical  mechanics  as  well  . 

12.2.3.  Units  in  spacetime  diagrams  revisited 

We  can  use  the  invariance  of  the  length  of  the  interval  (As)2  to  graphically  construct  the  units  of  time  and 
space  in  a  spacetime  diagram  that  has  undergone  a  Lorentz  transformation,  see  figure  12.5.  For  fixed  value 
of  (As)2,  the  line  given  by  equation  (  )  becomes  a  hyperbola  in  a  Minkowski  diagram.  For  (As)2  =  0,  we 

get  the  straight  line  with  a  45°  angle  describing  the  path  of  a  ray  of  light  (dashed  black  line  in  figure  12.5). 
For  positive  values  of  (As)2,  we  get  hyperbolas  that  intersect  the  ct  axis  at  cAf  =  As,  while  asymptotically 
approaching  the  (As)2  =  0  line  for  large  values  of  Ax.  These  lines  thus  also  intersect  the  ct'  axis,  at  which 
point  cAf'  =  As.  Likewise,  for  negative  values  of  (As)2,  we  can  map  points  on  the  x  axis  to  points  on  the  x' 
axis. 


12.3.  Worldlines  and  proper  time 


The  trajectory  followed  by  a  particle  through  space  and  time  is  commonly  called  a  worldline  in  the  theory 
of  relativity.  In  general,  a  worldline  does  not  have  to  be  a  straight  line  in  a  spacetime  diagram  -  you  can  of 
course  speed  up  and  slow  down  (relative  to  a  stationary  observer)  as  you  please.  Any  clock  you  bring  with  you 
will,  from  your  point  of  view,  keep  on  ticking  at  the  same  rate  as  always.  For  the  stationary  observer  its  rate 
will  however  depend  on  your  speed,  and  if  you  change  direction  or  accelerate,  that  will  have  an  effect  as  well. 
We  call  the  time  recorded  by  a  clock  in  the  comoving  reference  frame  of  a  particle  the  particle’s  proper  time, 
usually  denoted  by  r.  We  can  calculate  the  proper  time  with  respect  to  a  stationary  observer  by  chopping  up 
the  particle’s  trajectory  into  small  pieces.  In  each  of  the  pieces,  the  particle’s  velocity  will  be  constant,  and 
we  can  calculate  the  time  dilation  as  we  did  before,  then  we’ll  integrate  over  the  pieces  to  get  the  total  proper 
time.  To  see  how  this  is  done,  consider  a  part  of  the  trajectory  between  t  -  t\  and  t  ~  ti+l,  in  which  the  particle 
travels  from  Xj  to  Xj+i  as  measured  by  a  stationary  observer.  That  observer  then  calculates  the  particles  speed 
in  this  interval  to  be 


Xj+i  -  Xj 
ti+ 1  —  tf 


(12.6) 


4  As  you  can  verify  next  time  you’re  on  a  train:  from  your  stationary  point  of  view,  the  platforms  you  encounter  will  all  pass  through  the 
same  point  next  to  you. 
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We  now  define  the  interval  of  proper  time  as  the  length  of  the  invariant  interval  As ,  corresponding  to  the 
displacement  divided  by  the  speed  of  light,  which  gives: 


At,  =  —  =  -\/ c2(f,-+i  -  tj)2  -  (x,-+i  -  x,-)2 
c  c 


,  (Xi+ 1  -  Xi)2 

(fi+1  -  ti)\l  1  -  37^ - 1—2  =  A ti\l  1  - 


C2(h+|  -  h) 


A  tj 
Y(Vi) ' 


(12.7) 


At,-  is  the  time  interval  as  measured  on  a  comoving  clock.  Unsurprisingly,  it  is  related  to  A  f;  through  the  time 
dilation  factor  y(n,).  To  calculate  the  total  time  which  has  passed  on  the  comoving  clock,  we  simply  sum  over 
all  discrete  intervals  i.  In  the  limit  where  the  length  of  the  interval  becomes  infinitesimally  short,  that  sum 
becomes  an  integral,  and  we  can  calculate  the  proper  time  by: 


At  = 


(12.8) 


where  [Ta,  f|, |  is  the  time  interval  as  measured  by  the  stationary  observer.  Note  that  equation  (  2.8)  holds  for 
any  kind  of  motion,  not  just  motion  at  constant  velocity. 
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12.4.  Problems 

12.1  In  the  (c  t,  x)  coordinates  of  an  inertial  frame  S  three  events  occur  at  Gi  =  ( 1 , 2) ,  G2  =  (5, 4)  and  G3  =  (3,6). 
Which  of  these  events  could  be  causally  linked? 

12.2  Two  astronauts  (we’ll  call  them  A  and  B)  leave  Earth  on  January  1st  in  opposite  directions.  Astronaut  A 
initially  travels  as  a  speed  4c/5,  and  B  with  3c/5.  The  astronauts  keep  traveling  until  they  measure  their 
distance  to  Earth  at  5  light  years,  at  which  point  they  turn  around  and  travel  back.  On  the  way  home, 
astronaut  A  travels  at  3c/5,  while  B  travels  at  4c/5. 

(a)  Determine  how  far  each  astronaut  has  traveled  in  Earth’s  frame  of  reference  before  (s)he  returns. 

(b)  Draw  a  spacetime  diagram  showing  the  trips  of  the  two  astronauts.  NB:  You  may  want  to  take  your 
space  for  this,  it  will  help  see  details  clearly. 

(c)  Which  astronaut  turns  around  first? 

(d)  Find  the  length  of  the  spacetime  interval  between  the  events  ‘astronaut  A  reaches  his  turning  point’ 
and  ‘astronaut  B  reaches  her  turning  point’,  as  measured  by  each  of  the  astronauts,  and  by  an  ob¬ 
server  back  on  Earth,  and  verify  that  the  spacetime  interval  is  indeed  invariant. 

(e)  Which  astronaut  returns  to  Earth  first? 

(f )  As  measured  by  astronaut  A,  what  is  the  total  distance  that  astronaut  B  travels? 

(g)  Before  they  left  Earth,  the  astronauts  promised  to  send  each  other  a  message  by  radio  every  new 
year.  How  many  such  messages  does  each  astronaut  send? 

(h)  Draw  the  lines  representing  the  radio  signals  of  each  astronaut  in  the  spacetime  diagram. 

(i)  For  each  astronaut,  determine  how  many  signals  they  sent  on  the  way  out,  way  back,  and  when 
already  back  on  Earth. 

(j)  After  both  astronauts  have  returned  to  Earth,  which  of  them  has  aged  more? 

12.3  You  observe  a  spaceship  with  astronauts  A  and  B  (fill  in  your  own  favorite  SciFi  characters  if  you  like) 
in  it,  moving  at  constant  velocity  u  >  0  along  the  x-axis  of  your  rest  frame  S.  At  time  t  =  to,  when  the 
spaceship  is  at  ( xg,yo,zo ),  astronaut  B  gets  on  board  a  small  shuttle  and  leaves  with  constant  velocity 
v  >  u  in  the  x-direction  to  a  distant  star,  which  is  stationary  in  your  rest  frame  S.  At  t  —  to,  you  measure 
the  distance  between  the  spaceship  and  distant  star  to  be  d.  Astronaut  B  reaches  the  star  at  t  —  t\  on  your 
clock,  does  whatever  (s)he  wanted  to  do  quickly,  then  returns  to  the  mother  ship  with  the  same  speed  v, 
reaching  it  at  t  =  t.2  on  your  clock.  The  mother  ship  meanwhile  kept  the  same  speed  u  throughout. 

(a)  Draw  a  spacetime  diagram  with  the  worldlines  of  astronauts  A  and  B.  Indicate  all  relevant  quantities, 
and  specify  the  slopes  of  any  lines  you  draw. 

(b)  Express  t2  and  X2  (the  place  where  the  shuttle  and  mother  ship  meet)  in  terms  of  the  quantities  x0, 
to,  u,  v,  and  d. 

(c)  Both  astronauts  carry  high-precision  watches  which  they  synchronize  when  B  leaves  the  mother 
ship.  Express  the  times  given  by  both  watches  upon  return  of  B  in  terms  of  the  given  quantities. 

(d)  Which  astronaut  has  aged  more  between  the  departure  and  return  of  B? 

12.4  Having  negotiated  a  cease-fire,  an  imperial  and  a  rebel  spaceship  leave  their  meeting  place  with  syn¬ 
chronized  clocks  (we’ll  set  the  meeting  place  at  x  —  0,  and  the  time  of  leaving  at  t  =  0),  departing  with 
relative  velocity  (i.e,  the  velocity  at  which  they  each  measure  the  other  ship  to  be  moving)  of  (3/5)c  in 
opposite  directions.  Once  he  has  thought  about  it  for  a  day,  the  imperial  captain  comes  to  the  conclu¬ 
sion  that  the  emperor  will  reward  him  higher  for  eliminating  the  rebels  than  making  peace  with  them, 
so  he  fires  a  photon  torpedo  at  the  rebel  ship.  Likewise,  having  pondered  the  option  of  peace  for  a  day, 
the  rebel  captain  decides  that  offense  is  the  best  defense,  and  fires  a  photon  torpedo  at  the  imperial 
ship.  Both  ships  are  equipped  with  launchers  that  can  fire  such  torpedos  with  speed  (4/5)  c  with  respect 
to  the  ship. 

(a)  Draw  a  spacetime  diagram  with  the  two  spaceships  and  the  two  photon  torpedos  as  seen  from  the 
imperial  ship.  Use  units  of  c- (half  a  day)  on  the  time  axis,  and  half  a  light-day  on  the  space  axis.  Pro¬ 
vide  the  calculation  for  each  value  that  you  have  to  compute.  NB:  make  your  drawing  large  enough 
for  details  to  be  properly  visible  and  use  a  ruler;  don’t  worry  about  the  paper,  we  promise  to  recycle 
it  in  due  time. 
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(b)  When,  as  measured  on  the  clock  of  the  imperial  ship,  does  the  torpedo  fired  by  them  hit  the  rebel 
ship? 

(c)  When,  as  measured  on  the  clock  of  the  imperial  ship,  does  the  torpedo  fired  by  the  rebels  hit  them? 

(d)  Fortunately  for  the  rebel  ship,  their  shield  is  quite  strong,  being  able  to  withstand  an  impact  with  a 
momentum  up  to  3.0  •  1011  kg  •  m/s.  The  torpedo  fired  by  the  imperial  ship  has  a  mass  of  1000  kg. 
Does  the  rebel  ship  survive  the  impact? 

(e)  Both  ships  have  a  home  base  1  light  week  away  from  the  meeting  point.  When  the  rebels  at  the 
home  base  see  the  imperial  ship  fire  on  their  friends,  they  decide  to  mount  a  counterattack.  They 
launch  another  ship  at  v  —  (3/5)c,  which  immediately  after  launch  also  fires  a  torpedo  with  (4/5) c 
relative  to  the  ship.  Does  that  torpedo  hit  the  imperial  ship  before  it  returns  home?  To  answer  this 
question,  you  may  assume  that  the  imperial  and  rebel  ship  left  the  meeting  point  with  equal  speeds, 
as  measured  from  the  rest  frame  of  the  bases.  You  may  also  find  drawing  another  spacetime  diagram 
useful. 

12.5  A  train  with  proper  length  L  moves  at  speed  4c/ 5  with  respect  to  the  ground.  A  passenger  lets  a  toy  train 

run  from  the  back  of  the  (moving)  train  to  the  front,  with  speed  c/3  as  measured  on  the  large  train. 

(a)  Draw  a  spacetime  diagram  with  the  worldlines  of  the  front  of  the  large  train,  the  back  of  the  large 
train,  and  the  small  train,  from  the  point  of  view  of  a  (stationary)  observer  on  the  ground. 

How  much  time  does  it  take  the  toy  train  to  cross  the  train,  and  what  distance  does  the  toy  train  cover, 

in: 

(b)  The  frame  of  the  large  train? 

(c)  The  frame  of  the  toy  train? 

(d)  The  frame  of  a  stationary  observer  on  the  ground? 

(e)  Verify  that  the  length  of  the  invariant  interval  (equation  12  A)  is  the  same  in  all  three  frames. 
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Position,  energy  and  momentum  in 

SPECIAL  RELATIVITY 


13.1.  The  position  four-vector 

As  we’ve  seen  in  the  previous  section,  we  can  define  a  ‘length’  As  that  is  invariant  under  Lorentz  transforma¬ 
tions  (equation  ): 

(As)2  =  (cAt)2  -  (Ax)  •  (Ax)  =  (cAf)2  -  (Ax)2  -  (Ay)2  -  (Az)2. 

As  is  clear  from  the  definition  of  As,  to  get  invariant  quantities,  we  should  not  think  of  space  as  measured 
in  three  dimensions,  but  of  spacetime,  measured  in  four  dimensions.  This  four- dimensional  world  of  special 
relativity  is  called  Minkowski  space,  and  its  vectors  have  four  components:  one  for  time  and  three  for  space. 
Conventionally,  we  add  the  time  component  as  the  zeroth  component  of  the  vector.  To  distinguish  between 
‘ordinary’,  three-dimensional  vectors  (which  are  represented  in  bold)  and  four-vectors,  we’ll  put  a  line  on  top 
of  the  latter.  The  position  four-vector  is  then  given  by: 

x=  (xo,  Xi,  X2,  X3)  =  (ct,x,y,z).  (13.1) 

We  would  like  to  be  able  to  determine  the  length  of  the  position  four-vector  by  taking  the  inner  product  of 
the  vector  with  itself.  However,  the  regular  inner  product  is  not  going  to  work,  because  instead  of  ( ct )2  +  x2  + 
y2  +  z2,  the  quantity  that  is  independent  of  the  reference  frame  is  (cf)2  -  x2  -  y2  -  z2.  We  therefore  define  the 
inner  product  of  two  four- vectors  a  and  b  as 

a- b  =  aobo  -  a\bi  -  <22^2  ~  ^3^3-  (13.2) 

As  we’ve  already  seen,  the  magnitude  of  the  position  four-vectors,  as  determined  by  its  inner  product  with  it¬ 
self  (equation  (  ))  is  independent  of  the  inertial  reference  frame  you  use  to  measure  it  in.  In  problem  3.3 

you’ll  show  that  consequently,  the  value  of  the  inner  product  of  any  two  four-vectors  is  reference  frame  inde¬ 
pendent. 

It  may  seem  that  we’re  back  to  normal  -  we’ve  added  a  dimension,  and  introduced  a  new  inner  product, 
but  with  those,  we  should  be  able  to  do  calculations  just  as  easily  as  in  ordinary  3D  space.  The  last  part  is  true, 
but  the  new  inner  product  is  actually  different  from  the  regular  one  in  one  very  important  respect:  the  value 
in  equation  (12.  )  can  be  zero  or  even  negative  for  nonzero  four-vectors  x!  To  see  what’s  going  on,  we  return 
to  the  spacetime  diagram,  in  particular  figure  12.:  c.  Suppose  we  have  a  particle  traveling  in  the  x-direction 
(taking  y  —  z  =  0  for  convenience) .  What  speed  does  it  need  for  the  length  of  its  four- vector  to  vanish?  For  that 
to  happen  we  need  ct-x,  or  v  -  xl  t  -  c,  so  x  •  x  becomes  zero  for  something  traveling  at  the  speed  of  light. 
Likewise,  x  •  x  is  positive  for  a  particle  traveling  slower  than  light,  and  negative  for  a  particle  traveling  faster 
than  light  (which  is  of  course  impossible,  since  such  a  particle  would  need  to  first  reach  the  speed  of  light, 
which  as  we’ve  seen  can  never  be  done).  However,  we  can  consider  the  four-vector  x  between  any  two  points 
in  spacetime,  and  from  the  sign  of  x  •  x  tell  whether  they  can  be  connected  trough  regular  (slower  than  light) 
travel,  by  a  light  beam,  or  not  at  all.  The  first  we  call  timelike,  the  second  lightlike,  and  the  third  spacelike : 

x  •  x  >  0  timelike 

xx=0  lightlike  (13.3) 

x  •  x  <  0  spacelike 


135 


136 


13.  Position,  energy  and  momentum  in  special  relativity 


Two  events  which  are  connected  by  a  spacelike  four-vector  cannot  influence  each  other:  there  is  no  way  to 
send  a  signal  between  them,  and  therefore  there  is  no  way  to  transfer  information. 


13.2.  Lorentz  transformation  matrix  and  metric  tensor* 

In  this  section  we’ve  joined  space  and  time  in  a  single  four-vector,  and  defined  a  new  inner  product  on  the 
space  of  those  four-vectors.  In  chapter  we  defined  the  Lorentz  transformations  of  the  space  and  time 
coordinates,  which  are  linear  transformations.  Linear  transformations  can  of  course  be  represented  by  ma¬ 
trices,  and  for  our  four-vectors,  we  can  write  down  the  appropriate  Lorentz  transformation  matrix,  rewriting 
equation  (  1.12)  as  a  vector  equation: 

x'  =  Lx.  (13.4) 

Here  L  is  a  4  x  4  matrix: 

7  («)  -7(K)f  0 

T_  -7(k)  f  7  (u)  0 

0  0  1 

,  0  0  0 

Like  with  the  four- vectors,  we  start  labeling  the  rows  and  columns  of  L  with  index  0.  To  indicate  the  difference 
with  matrices  in  regular  space,  it  is  conventional  to  indicate  indices  of  regular-space  vectors  and  matrices 
with  Roman  letters  (like  V{  forthe  /th  component  of  vector  v,  and  Aij  forthe  (throw,  /th  column  ofmatrix  A), 
and  those  of  Minkowski-space  vectors  and  matrices  with  Greek  letters  -  so  we  write  xfi  for  the  pth  component 
of  the  four-vector  x,  where  p  can  be  0, 1,  2,  or  3. 

We  can  also  write  equation  (13.'  )  in  index  form: 

3 

4  =  E  Llx^  (13.6) 

v=0 


0  1 
0 
0 
1 


(13.5) 


The  kind  of  summation  in  equation  (  )  is  very  common  -  so  common,  that  most  people  adopt  the  sum¬ 

mation  convention:  you  always  sum  over  repeated  indices  -  from  1  to  3  if  they’re  Roman,  from  0  to  3  if  they’re 
Greek  -  and  leave  out  the  explicit  summation  sign.  Equation  (  )  then  simply  reads  x'^  =  L^xv.  Some  au¬ 

thors  add  the  strict  condition  that  you  only  sum  over  repeated  indices  if  one  is  down  and  the  other  up  (like  in 
equation  (13.6),  where  the  up  index  of  Ljt  and  the  down  index  of  xv  are  summed  over),  whereas  others  sum 
over  everything  that’s  repeated,  so  you  should  always  be  explicit  in  which  form  of  the  convention  you  follow. 
Just  to  be  clear:  equation  (13.6)  is  simply  the  expression  in  elements  of  the  multiplication  of  the  matrix  given 
in  equation  (  )  with  the  four-vector  x,  so  you  could  also  write  it  as 


(  x'  \ 

"  J{u) 

-7(u)f 

0 

0  1 

'  x0  1 

x' 

- 7(« )f 

7(“) 

0 

0 

*i 

4 

0 

0 

1 

0 

*2 

4  J 

l  0 

0 

0 

1  > 

,  *3 

(13.7) 


The  Lorentz  transform  can  be  thought  of  as  a  map  (in  the  mathematical  sense”)  from  Minkowski  space  to 
itself:  it  takes  a  four-vector  and  maps  it  on  another  four-vector.  The  other  operation  we’ve  encountered,  the 
inner  product  of  two  four- vectors,  is  also  a  map,  but  its  properties  are  different:  it  takes  as  its  argument  two 
four-vectors,  and  the  result  is  a  real  number.  Moreover,  as  with  all  inner  products,  the  map  is  linear  in  both 
its  arguments  (we  call  such  maps  bilinear).  A  bilinear  map  from  a  vector  space  to  a  real  number  is  a  (rank) 
two-tensor3 .  The  tensor  that  gives  the  inner  product  of  two  position  vectors  is  known  as  the  metric  tensor, 
because  this  inner  product  is  related  to  the  vectors’  length.  Any  two-tensor  can  be  represented  as  a  matrix; 
the  metric  tensor  g1' v  of  Minkowski  space  is  given  by: 


r  1  0  0  0  1 

0-100 
0  0-10 
,0  0  0  -1  , 


(13.8) 


1In  one  of  Einstein’s  most  quoted  statements,  he  jokingly  called  the  summation  convention  'his  most  important  contribution  to  science’. 

2  If  you’ve  never  heard  of  maps  before:  you’re  actually  familiar  with  them,  as  any  mathematical  function  is  a  map.  For  instance,  the 
function  f{x,  y,  z )  =  \j  xA  +  y2  +  z2,  which  gives  you  the  norm  of  the  vector  x  =  (x,  y,  z),  is  a  map  from  R3  to  R. 

3  A  vector  is  a  (rank)  one-tensor;  a  (rank)  n-tensor  maps  n  vectors  on  R,  linear  in  each  of  its  arguments. 
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and  we  can  use  it  to  calculate  the  inner  product  as  (using  the  summation  convention  on  both  indices): 

a  b  =  g^v afjbv  =  agbo  -  a\b\  -  ^2^2  ~  <23^3-  (13.9) 

The  metric  tensor  may  not  seem  very  useful  at  this  point  -  after  all,  we  don’t  really  need  it  to  calculate  the 
inner  product,  as  long  as  we  remember  the  definition  (13.2).  It  will  start  to  matter  in  the  theory  of  general 
relativity,  in  which  spacetime  can  be  curved  by  gravity.  In  that  case  the  components  of  the  metric  tensor 
change,  and  inner  products  of  position  four-vectors  become  complicated  expressions. 

13.3.  Velocity  and  momentum  four-vectors 

We  already  encountered  the  position  four-vector  in  section  .  The  four  components  of  this  four-vectors 
represent  a  ‘position’  in  the  four- dimensional  spacetime  of  special  relativity:  three  coordinates  in  space,  one 
in  time.  We  defined  a  dot  product  for  these  four-vectors  in  equation  (13.1 ),  and  an  associated  length  of  a  four 
vector  (the  square  root  of  the  dot  product  with  itself).  Positions  are  however  not  the  only  vector  quantities 
we  know  -  we  encountered  many  others  in  part  ,  like  velocity  and  momentum.  Like  position,  these  vector 
quantities  have  four-vector  versions  in  special  relativity.  Also,  like  the  position  four- vector,  we’ll  demand  that 
their  length  should  be  invariant  under  Lorentz  transformations. 

Let  us  start  with  velocity.  In  classical  mechanics,  we  calculate  velocity  as  the  time  derivative  of  the  posi¬ 
tion:  v  =  dxl  df.  It  is  a  straightforward  exercise  to  check  that  the  time  derivative  of  the  position  four- vector  x 
is  not  a  four- vector  itself,  as  its  length  is  not  invariant  under  Lorentz  transformations.  Perhaps  this  does  not 
surprise  you:  after  all,  t  is  the  time  as  measured  by  an  outside  observer,  and  we’ve  argued  extensively  that  time 
depends  on  the  observer.  Instead  of  taking  the  derivative  to  t,  we  should  therefore  take  the  derivative  with 
respect  to  time  on  a  comoving  clock,  i.e.,  the  proper  time  r,  in  which  case  we  do  get  the  velocity  four-vector: 

dir 

v=  —  =7(U(c,  vx,  vY,  vz),  (13.10) 

dr 

where  vx,  vy  and  vz  are  the  classical  velocity  components,  and  v  is  the  speed.  A  straightforward  calculation 
shows  that  the  length  of  v  equals  c,  which  is  certainly  invariant  under  Lorentz  transformations. 

Momentum  in  classical  mechanics  is  defined  as  mass  times  velocity.  We  use  the  same  definition  in  special 
relativity,  with  the  only  difference  that  the  velocity  is  now  given  by  the  velocity  four- vector.  Because  the  mass 
is  just  a  scalar,  it  simply  multiplies  the  velocity  four-vector,  and  is  not  affected  by  Lorentz  transformations,  so 
this  procedure  again  yields  a  proper  four- vector  with  length  me: 

p=mv-  my{v){c,vx,vy,  vz)=  ( y{v)mc,pXtpy,pz ),  (13.11) 

where  px  -  y{v)mvx,  py  -  y(v)mvx  and  pz  -  y{v)mvx  are  the  components  of  the  ‘three-momentum’  p  - 
{px,  py,  pz).  Note  that  unlike  the  velocities,  these  components  are  not  equal  to  their  classical  counterparts  , 
as  they  differ  by  a  factor  y(v). 

13.4.  Relativistic  energy 

The  last  three  (or  ‘spatial’)  components  of  the  momentum  four-vector  give  us  the  regular  components  of  the 
momentum,  times  the  factor  y(v).  What  about  the  zeroth  (or  ‘temporal’)  component?  To  interpret  it,  we 
expand  y(v),  and  find: 

c>  8  Vc 

The  second  term  in  this  expansion  should  be  familiar:  it’s  the  kinetic  energy  of  the  particle.  The  third  and 
higher  terms  are  corrections  to  the  classical  kinetic  energy  -  just  like  the  higher-order  terms  in  the  spatial 
components  are  corrections  to  the  classical  momenta.  The  first  term  however  is  new:  an  extra  energy  contri¬ 
bution  due  to  the  mass  of  the  particle.  The  whole  term  can  now  be  interpreted  as  the  relativistic  energy  of  the 
particle: 

E  —  y(v)mc2  —  me2  +  K.  (13.13) 

4This  difference  is  unfortunate,  as  it  might  be  confusing.  We  keep  the  classical  components  of  the  velocity,  as  j(v)  is  expressed  in  these 
classical  values.  For  the  momenta  however,  we'll  frequently  use  the  relativistic  version,  so  it  is  more  convenient  to  define  px,  Py  and  pz 
as  components  of  p. 


epo  -  y(v)mc  - 


1  +  - 
2 


s( 


+ ... 


2  2  1  2 

me  -  me  +  -mv  + . 
2 


(13.12) 
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We  can  now  write  the  zeroth  component  of  the  momentum  four- vector  as  po  -  El  c.  Based  on  this  interpre¬ 
tation,  the  four-vector  is  sometimes  referred  to  as  the  energy-momentum  four-vector. 

A  very  useful  relation  can  now  easily  be  derived  by  calculating  the  length  of  the  energy-momentum  four- 
vector  in  two  ways.  On  the  one  hand,  it’s  given  by  (leaving  out  the  square  root  for  convenience) 

p  ■  p  -  m2v  ■  v  -  m2c2,  (13.14) 


while  on  the  other  hand,  we  could  also  simply  expand  in  the  components  of  p  itself  to  get: 

P'P  =  (f)  -P-P- 


where  p  is  again  the  spatial  part  of  p.  Combining  equations  (13.1  )  and  (13.15),  we  get: 


(13.15) 


r-i2  2  4,  2  2 

E  -  m  c  +  p  c  , 


(13.16) 


where  p2  -  p  ■  p.  Equation  (  16)  is  the  general  form  of  Einstein’s  famous  formula  E  =  me2,  to  which  it 

reduces  for  stationary  particles  (i.e.,  when  v  -  p-  0). 


13.5.  Conservation  of  energy  and  momentum 

In  classical  mechanics,  energy  and  momentum  were  separate  entities,  each  obeying  its  own  conservation 
law.  In  special  relativity,  they  are  two  parts  of  the  same  quantity  (the  energy- momentum  four- vector),  just 
like  time  and  space  are  two  parts  of  the  same  position  four-vector.  Consequently,  energy  and  momentum 
have  to  obey  the  same  rules  in  special  relativity.  Fortunately,  a  conservation  law  on  a  vector  quantity  applies 
to  each  of  its  components,  and  so  conservation  of  energy  and  momentum  translates  to  conservation  of  the 
energy-momentum  four-vector  p.  However,  unlike  in  classical  mechanics,  mass  is  no  longer  conserved:  since 
it  is  now  interpreted  as  a  part  of  the  total  energy  of  a  system  (equation  .13),  it  can  be  converted  into  or 
created  from  kinetic  energy.  The  equivalence  of  mass  and  energy  has  important  consequences  for  collision 
experiments,  including  a  whole  new  type  of  ‘collisions’:  radioactive  decay  of  matter. 

You  might  complain  that  we  haven’t  actually  proved  that  the  energy-momentum  four- vector  is  conserved 
in  special  relativity  (and  you  would  be  right).  What  we  have  done  is  define  the  relativistic  energy  E  =  y(  u)  me2 
and  three-momentum  p  =  j(v)rnv,  as  well  as  the  energy-momentum  four- vector  p.  We  have  also  shown 
that  with  these  definitions,  p  is  a  proper  four-vector,  in  the  sense  that  it  is  invariant  under  Lorentz  trans¬ 
formations.  Therefore,  we  know  that  if  it  is  conserved  in  one  inertial  frame,  it  must  also  be  conserved  in 
all  others.  We  also  know  that  our  relativistic  energy  and  momentum  revert  to  the  classical  kinetic  energy 
(plus  a  constant,  me2)  and  the  classical  momentum  mv  at  low  velocities.  The  conservation  laws  for  these 
classical  quantities  follow  from  Newton’s  second  and  third  laws  of  motion,  respectively.  In  special  relativity, 
we  no  longer  take  these  laws  as  our  axioms,  only  retaining  Newton’s  first  law  of  motion  in  inertial  reference 
frame.  We  therefore  cannot  prove  conservation  of  the  energy-momentum  four-vector  mathematically,  and 
must  take  it  as  an  axiom.  As  I’ve  just  argued,  this  axiom  is  consistent  with  the  laws  of  classical  mechanics 
in  the  low-velocity  limit.  It  is  also  consistent  with  experimental  data  -  which,  like  Einstein’s  postulates  and 
Newton’s  laws  in  classical  mechanics,  is  the  ultimate  test  of  our  physical  model. 
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13.6.  Problems 

13.1  In  high-energy  physics,  it  is  customary  to  express  the  mass  of  elementary  particles  not  in  kilograms  but 
in  MeV/c2,  expressing  the  fact  that  (rest)  mass  is  a  form  of  energy.  An  MeV  or  mega- electron- Volt  is 
one  million  (the  ‘mega’)  times  the  (kinetic)  energy  an  electron  gains  when  it  moves  through  an  electric 
field  between  two  positions  with  an  electric  potential  difference  of  1  volt,  or  1  joule  per  coulomb.  One 
electron-volt  thus  corresponds  to  an  amount  of  energy  (in  joules)  equal  in  number  to  the  charge  of  the 
electron  in  coulombs.  Express  the  mass  of  both  the  electron  and  the  proton  in  MeV/ c2;  you  may  find 
the  numbers  in  table  useful. 

13.2  For  an  arbitrary  particle  of  (rest)  mass  m,  find  the  speed  at  which  its  kinetic  energy  equals  its  rest  energy. 

13.3  We  constructed  four  vectors  in  such  a  way  that  their  length  is  invariant  under  Lorentz  transformations. 
The  length  of  a  four-vector  is  defined  as  the  square  root  of  its  dot  product  with  itself:  |J6[  =  vx  ■  x  = 
Xq  -  xf  -  x2  -  In  equation  (13.; )  we  also  defined  the  dot  product  of  two  arbitrary  four-vectors  a  and 

b. 

(a)  Show  that  the  sum  of  two  four-vectors  is  again  a  four-vector  (i.e.,  show  that  the  length  is  invari¬ 
ant  under  Lorentz  transformations,  and  the  components  transform  the  same  way  that  those  of  the 
position  four-vector  do). 

(b)  Calculate  the  square  of  the  length  of  the  four- vectors  a  +  b  and  a-b. 

(c)  Use  your  answer  at  (b)  to  write  the  dot  product  of  a  and  &  as  a  linear  combination  of  quantities  that 
are  invariant  under  Lorentz  transformations  (thus  showing  that  the  dot  product  is  also  invariant). 

13.4  A  particle  with  mass  m  has  three-momentum  p  as  measured  in  an  inertial  lab  frame  S.  Find  the  parti¬ 
cle’s  energy  as  measured  by  an  observer  with  three-velocity  u.  Hint.  Determine  the  four- vectors  of  the 
particle’s  momentum  and  the  observer’s  motion  both  in  the  lab  frame  S  and  the  observer’s  rest  frame  S’, 
then  use  the  fact  that  inner  products  of  four-vectors  are  invariant  under  Lorentz  transformations. 
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Relativistic  collisions 


In  supercolliders  such  as  the  ones  at  CERN  in  Geneva  and  (formerly)  Fermilab  in  Chicago,  small  particles  like 
electrons  and  protons  are  accelerated  to  speeds  near  that  of  light,  then  made  to  collide  with  each  other  in  an 
attempt  to  create  exotic  types  of  matter  (i.e.,  non-common  particles).  The  very  reason  why  this  can  be  done  is 
the  relation  between  energy,  mass  and  momentum  given  by  the  general  version  of  Einstein’s  famous  equation, 
equation  (13.16).  This  equation  tells  us  that  if  the  incoming  particles  have  sufficiently  high  kinetic  energy,  we 
can  create  new  particles  with  more  mass  than  the  originals  had.  The  process  by  which  this  happens  is  the 
realm  of  quantum  field  theory,  but  the  mechanics  of  the  collisions  can  be  studied  within  special  relativity. 

fust  like  in  classical  mechanics,  we  can  define  a  totally  inelastic  collision  as  any  collision  in  which  the  par¬ 
ticles  stick  together.  We  define  a  totally  elastic  collision  as  a  collision  in  which  the  momentum,  kinetic  energy, 
and  mass  of  all  particles  are  conserved.  We’ll  have  one  more  type,  that  has  no  classical  counterpart:  radioac¬ 
tive  decay,  in  which  a  particle  falls  apart  into  multiple  particles  -  a  sort  of  time-reversed  inelastic  collision.  All 
cases  can  be  analyzed  using  conservation  of  energy-momentum.  Although  that  basic  concept  is  in  principle 
sufficient,  there  are  many  cases  for  which  writing  out  the  components  of  the  energy-momentum  four  vector 
as  four  equations  is  not  the  easiest  way  to  find  (say)  the  energies  or  momenta  of  the  outgoing  particles.  There 
are  some  other  tricks  that  you  can  use  -  in  particular  the  invariance  of  the  length  of  the  energy-momentum 
four  vector,  both  in  a  collision  process  and  under  a  Lorentz  transformation.  A  few  examples  will  help  to  illus¬ 
trate  this  point. 


14.1.  Photons 

Before  we  dive  into  the  examples,  there’s  one  particle  that  requires  special  attention:  the  photon,  or  quantum 
of  light  -  from  that  other  early-20th-century  theory  known  as  quantum  mechanics.  Photons  travel  (by  defini¬ 
tion)  at  the  speed  of  light,  and  need  therefore  be  massless.  They  do  carry  energy  though,  which  is  related  to 
their  frequency  /  (or  wavelength  A,  or  color)  through 

he 

-Ephoton  =  hf  —  ~T>  (14.1) 

A 


where  h  is  Planck’s  constant.  Since  photons  have  nonzero  energy,  they  also  have  nonzero  momentum  through 
Einstein’s  equation  (13.16),  despite  the  fact  that  they  have  no  mass 


Ephoton 

Pphoton  = 


hf  =  h 
c  A 


(14.2) 


A  photon  with  frequency  /  (and  thus  energy  E  =  hf )  traveling  in  the  positive  x  direction  has  a  very  simple 
energy-momentum  four  vector: 

Pphoton  =  (E7  c,  El  c,  0, 0).  (14.3) 

The  length  of  this  four- vector,  unsurprisingly,  is  zero. 

1  Note  that  relativistic  momentum  is  given  by  p  =  y(v)mv,  substituting  v  =  c  gives  y(c)  =  oo,  and  so  this  expression  gives  us  the  product 
of  infinity  with  zero  for  the  momentum  of  the  photon  -  that  could  be  anything,  and  thus  is  meaningless.  The  way  to  calculate  the 
momentum  of  the  photon  is  through  equation  (14.2).  Although  the  photon  momentum  is  small,  it  is  large  enough  to  be  measured,  and 
even  useful  in  devices  known  as  optical  tweezers,  where  focused  laser  beams  are  used  to  move  micron-sized  objects  around. 
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14.  Relativistic  collisions 


14.2.  rOTALLY  INELASTIC  COLLISION 

In  a  totally  inelastic  collision,  particles  stick  together.  A  possible  example  is  the  absorption  of  a  photon  by 
a  massive  particle,  resulting  in  an  increase  in  its  mass,  as  well  as  possibly  a  change  in  its  momentum.  Let’s 
consider,  as  an  example,  a  particle  of  mass  m  that  is  initially  at  rest,  and  absorbs  an  incoming  photon  with 
energy  Ey.  There  are  now  three  ways  to  calculate  the  energy  and  momentum  of  the  particle  after  this  collision. 

Method  1 :  We  have  conservation  of  both  (total)  energy  and  momentum.  Before  the  collision,  the  massive 
particle  has  energy  E\  =  me 2  (as  it  is  standing  still),  and  the  total  energy  of  the  system  is  Ey  +  me2,  which  must 
be  conserved.  The  total  energy  of  the  particle  after  the  collision  is  Ef  —  j(_v)mfC2,  where  both  the  velocity  u 
and  the  mass  m f  are  unknown.  The  total  momentum  before  the  collision  is  Eylc,  as  the  particle  is  initially 
standing  still  (and  thus  has  momentum  zero),  while  after  the  collision  it  is  j{v)mfV.  We  thus  have: 


Ey  +  me2  -  j[v)mfC2, 

(14.4) 

Ey  —  j{v)mfvc. 

(14.5) 

We  thus  have  two  equations  with  two  unknowns  ( v  and  mf).  If  we  divide  equation  (  1.5)  by  (  ),  we  get  an 

expression  for  the  final  velocity  v,  which  we  can  substitute  back  in  either  equation  to  solve  for  mf  (and  poten¬ 
tially  use  to  calculate  the  momentum  after  the  collision).  This  is  not  pretty  though,  as  we’ll  have  complicated 
factors  due  to  the  presence  of  y(t'). 

Method  2:  The  four-momentum  of  the  system  is  conserved  during  the  collision.  We  have 

Pr  =  \ 

for  the  photon, 

(14.6) 

Pi  =  i 

[me,  0,0, 0) 

for  the  massive  particle  before  the  collision, 

(14.7) 

Pf  = 

(f,H 

for  that  particle  after  the  collision. 

(14.8) 

From  pr  +  pi  —  f>{  we  can  read  off  two  equations: 

Ey  +  me2  =  Ef,  (14.9) 

Eylc- p^  (14.10) 

which  immediately  give  us  the  final  energy  and  momentum  in  terms  of  the  initial  ones.  We  can  now  find  the 
final  mass  through  Einstein’s  equation  (13.16): 

m2 c4  -  E2  -  p2c2  =  ( Ey  +  me2)2  -  E2  —  ( Ey  +  mc2)mc2.  (14.11) 

This  approach  circumvents  the  use  of  the  y(v)  factor,  because  we  only  use  energy  and  momentum,  not  (clas¬ 
sical)  velocity.  If  we  now  want  the  velocity,  we  could  still  calculate  it  from  the  combination  of  m\  and  either  Ef 
or  p{,  but  since  it  were  the  mass  and  momentum  we  were  after,  there’s  no  need  to  do  so. 

Method  3:  Since  the  total  energy-momentum  four  vector  is  conserved  in  the  collision,  so  must  be  its  length 
(or  the  square  of  the  length),  which  is  trivial  to  calculate  (remember  that  p-  p  =  m2c2).  We  can  often  exploit 
this  fact  to  make  the  maths  much  simpler.  To  see  how  this  works,  let’s  consider  the  full  four-vector  equation 
for  our  example:  pr  +  pi-  pf,  so 


{py+  Pi)  ■  (pY  +  P\)  =  Pf  pi  (14.12) 

Py  ■  py  +  Pi  pi  +  2 py  ■  pi  =  pf  ■  pf  (14.13) 

0  +  m2c2  +  2Erm  —  m2  c2,  (14.14) 

which  immediately  gives  us  mf.  If  we  also  want  Ef  or  pf,  we  can  again  use  equations  (  1.9)  and  (  )  for  the 

components,  but  if  we  only  wanted  the  final  mass,  we’re  done  in  one  step. 

Note  that  although  method  3  usually  is  the  easiest  route  to  your  answer,  it  isn’t  always  -  and  it  is  a  good 
idea  to  at  least  be  aware  of  the  other  options. 


14.3.  Radioactive  decay  and  the  center-of-momentum  frame 
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14.3.  Radioactive  decay  and  the  center-of-momentum  frame 

Radioactive  decay  is  the  process  by  which  unstable  particles  with  high  mass  fall  apart  into  more  stable  parti¬ 
cles  with  lower  mass.  Although  the  process  itself  is  quantum  mechanical  in  nature,  the  dynamics  of  radioac¬ 
tive  decay  are  described  by  special  relativity,  and  are  essentially  identical  to  those  of  an  inelastic  collision  in 
reverse. 

Decay  may  occur  spontaneously  (as  a  random  process),  but  can  also  be  stimulated,  by  the  absorption  of 
a  (typically  small,  e.g.  a  photon  or  electron)  particle  by  the  unstable  one  -  a  process  used  in  nuclear  reactors. 
Because  the  absorbed  particle  also  carries  energy,  in  stimulated  decay  the  masses  of  the  resulting  particles 
can  add  up  to  something  more  than  the  rest  mass  of  the  original  particle.  An  important  question  in  nuclear 
physics  is  what  the  threshold  energy  of  a  given  reaction  is,  i.e.,  the  minimum  energy  the  incoming  particle 
must  have  for  the  process  to  be  possible.  This  is  not  simply  the  differences  in  the  mass  energy  of  the  original 
and  the  resulting  particles,  as  in  the  collision  process  momentum  must  also  be  conserved.  To  illustrate  how 
to  approach  such  a  problem,  let’s  again  consider  a  concrete  example:  the  threshold  energy  for  the  reaction  in 
which  a  proton  (mp  =  938  MeV/c2),  initially  at  rest,  absorbs  a  photon,  and  then  emits  a  neutral  pion  [mn  - 
135  MeV/ c2),  see  figure  a. 
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Figure  14.1:  Example  of  stimulated  radioactive  decay:  a  proton,  initially  at  rest,  absorbs  a  photon,  and  then  emits  a  neutral  pion.  The 
reaction  is  shown  in  the  lab  frame  in  (a),  and  in  the  center-of-momentum  frame  in  (b). 


Figuring  out  what  the  minimal  required  energy  is  in  the  lab  frame  is  not  easy,  as  you  have  to  account  for 
the  kinetic  energy  of  the  particles  after  the  reaction.  There  is  however  a  system  in  which  the  reaction  prod¬ 
ucts  are  standing  still:  the  center-of-momentum  frame,  the  relativistic  analog  of  the  center-of-mass  frame  of 
classical  mechanics  .  The  center-of-momentum  frame  is  defined  as  the  frame  in  which  the  total  momentum 
of  all  particles  adds  up  to  zero.  In  our  specific  example,  before  the  collision,  only  the  photon  carries  a  mo¬ 
mentum,  equal  to  its  energy  Er  divided  by  the  speed  of  light.  In  general,  the  total  momentum  in  the  system 
can  be  a  three-vector,  equal  to  p\  =  X;  Pi,  while  the  total  energy  is  given  by  Ex  =  L /  A  ■  If  we  choose  our  coor¬ 
dinates  such  that  the  x-direction  coincides  with  that  of  pj,  the  energy- momentum  four-vector  of  the  entire 
system  becomes  pj  =  (Ejlc,pj,  0,0),  where  p\  -  \pj\.  If  we  go  to  any  different  inertial  frame  S'movingwith 
velocity  v  in  the  positive  x  direction,  the  components  of  the  energy-momentum  four  vector  are  given  by  the 
Lorentz  transform  of  pj\ 


Py  =  J{V) 


v  v  Ej 

PT  y  PT - , 

C  C  C 


(14.15) 


2  As  our  system  includes  a  photon,  a  center-of-mass  frame  doesn’t  make  sense  here,  as  the  photon  has  no  mass  -  but  it  has  nonzero 
momentum,  so  we  can  make  a  transformation  to  a  system  in  which  the  total  momentum  vanishes. 
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so  we  end  up  in  a  frame  in  which  the  total  momentum  is  zero  if  we  pick 

c2  pi 

vcom  =  (14.16) 

bj 

for  our  velocity.  In  particular,  we  see  that  we  can  always  make  this  transformation,  and  that  the  center-of- 
momentum  frame  is  an  inertial  frame. 

Back  to  our  example:  why  do  we  care?  The  answer  is  almost  tautological:  if  the  total  momentum  is  zero 
before  the  collision,  it  is  also  zero  afterwards  -  and  so  in  the  COM  frame,  the  particles  can  all  be  standing  still 
(see  figure  b) .  That  certainly  corresponds  to  the  lowest  possible  kinetic  energy  of  the  system,  so  the  energy 
of  the  incoming  photon  is  all  converted  to  mass  -  and  that  must  thus  be  the  threshold  energy  we’re  looking 
for.  Interestingly,  to  answer  our  original  question,  we  don’t  even  need  to  calculate  what  the  actual  velocity 
of  the  COM  frame  is,  just  the  fact  that  it  exists  is  sufficient.  In  the  COM  frame,  we  have,  by  conservation  of 
four-momentum: 

Py  +  Pp,i  =  PpJ'  +  P'tc>  (14-17) 

and  therefore  also 

(j Py  +  Pp,i )2  =  iPpJ'  +  p'n )2  =  ( mpc+mnc )2,  (14.18) 

where  the  last  equality  follows  from  the  fact  that  the  reactants  are  standing  still.  Now  the  left-hand-side 
of  equation  (  .18)  is  the  length  of  a  four- vector,  and  we’ve  proven  that  these  lengths  are  invariant  under 

Lorentz  transformations  -  so  it’s  value  is  equal  to  that  of  ( pr  +  pp,i)2  in  the  lab  frame.  In  that  frame,  we  have 
Py  -  {Eyl c)  (1, 1,0,0)  and  pp>;  =  (mpc,  0,0,0),  so  we  end  up  with  an  easy  equation  form  Er: 

(rap  +  mn)2c2  =  p2  +  p2ti  +  2 p2  •  pPt i  =  0  +  m2c 2  +  2 £r rap,  (14.19) 


or 


Ey  — 


mi.  +  2mnm0  0 

— - c 2  =  145  MeV. 

2  m„ 


(14.20) 


In  this  example,  we  thus  need  at  least  10  MeV  of  energy  more  than  the  mass  of  the  particle  we’ve  created. 

Note  that  in  finding  the  threshold  energy  in  the  example,  we  again  heavily  relied  on  the  four-vector  prop¬ 
erties  of  p  -  not  only  it’s  length  (like  in  the  third  method  of  section  2),  but  also  the  invariance  of  that  length 
under  Lorentz  transformations.  Using  these  properties  results  in  easy  equations  to  solve,  while  if  you’d  ignore 
them,  you’d  probably  get  stuck  trying  to  figure  out  what  the  kinetic  energy  of  the  reaction  products  is. 


14.4.  Totally  elastic  collision:  Compton  scattering 

As  a  final  example  of  a  collision  in  special  relativity,  we  consider  the  totally  elastic  case:  a  collision  in  which 
the  total  momentum,  total  kinetic  energy,  and  the  mass  of  all  particles  are  conserved.  An  example  of  such 
a  collision  is  Compton  scattering:  the  collision  between  a  photon  and  an  electron,  resulting  in  a  transfer  of 
energy  from  one  to  the  other,  visible  in  a  change  of  wavelength  of  the  photon.  For  our  example,  we’ll  take  the 
electron  to  be  initially  stationary,  and  the  photon  to  be  coming  in  along  the  x-axis;  after  the  collision  both 
particles  have  nonzero  velocities  in  both  the  x  and  y  directions  (see  figure  4.2). 

before  after 


(Zrvy-v  © 


Figure  14.2:  Compton  scattering  between  a  photon  and  an  electron,  resulting  in  a  transfer  of  energy  of  the  photon  to  the  electron, 
measurable  as  a  change  in  the  photon’s  wavelength. 


14.4.  Totally  elastic  collision:  Compton  scattering 
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The  four-momenta  of  the  electron  and  photon  before  and  after  the  collision  are  given  by: 


r  mec  1 

f  1 

f  Ee:y/c  1 

(  1 

0 

E\ 

1 

pe,fCOS0 

Et 

cos  8 

Pe,  i  = 

0 

’  Pr'i_  c 

0 

-  Pe,  f  = 

Pe,fSin(/> 

1  u 
II 

a. 

-sind 

l  0  , 

l  0  , 

l  0  J 

l  0 

(14.21) 


We  can  now  solve  for  the  energy  H\  of  the  outgoing  photon  (and  thus  its  wavelength)  in  terms  of  that  of  the 
incoming  photon  (£;)  and  the  scattering  angle  8.  There  are  again  (at  least)  two  ways  to  do  this.  One  is  to 
compare  the  components  of  the  initial  and  final  energy-momentum  four-vector  term  by  term.  The  other  is 
to  again  use  the  fact  that  we  know  about  the  length  of  the  four- vector  to  immediately  eliminate  the  scattering 
angle  <p  of  the  electron.  To  do  so,  we  first  rewrite  the  conservation  of  energy- momentum  equation,  pe> ;  +  /5,,j  = 
pe  t  +  py  f  to  isolate  the  term  of  the  outgoing  electron,  and  then  take  the  square,  to  get: 


(Pe,i  +  Pr,i-pr,f)2  =  Pe,f 
P2e, i  +  Py, i  +  Py,f  +  Pe,i  ■  Py, i  -  2pe,i  •  Py, f  -  2pr,i  ■  py, f  =  p2eS 

E'  E  ■ 

mgC2  +  0  +  0  +  2me£’i-2me£'f-2— 1 ^-(1-cosd)  =  m2ec2, 

c *■ 

from  which  we  can  solve  for  Ey.  Rewriting  to  wavelengths  (through  E-hf-  he IX),  we  get 

h 

Af  =  A;  H - (1  -  cosd). 

mec 


(14.22) 

(14.23) 

(14.24) 


(14.25) 
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14.5.  Problems 

14. 1  A  photon  with  frequency  /  collides  with  a  stationary  atom  with  rest  mass  m.  In  the  collision,  the  photon 
is  absorbed  by  the  atom.  Determine  the  mass  and  speed  of  the  atom  after  the  collision. 

14.2  A  particle  with  mass  m  and  kinetic  energy  2 me2  collides  with  a  stationary  particle  with  mass  2m.  After 
the  collision,  the  two  particles  are  fused  into  a  single  particle.  Find  both  the  mass  and  the  speed  of  this 
new  particle. 

14.3  A  stationary  atomic  nucleus  undergoes  a  radioactive  process  known  as  /5-decay,  in  which  one  of  its 
neutrons  (with  rest  mass  mn  =  939.6  MeV)  falls  apart  into  a  proton  (which  remains  in  the  nucleus,  rest 
mass  mp  =  938.3  MeV),  an  electron  (rest  mass  me  -  0.5  MeV),  and  an  anti-neutrino.  Neutrino’s  are  very 
light  particles;  we’ll  take  the  emitted  neutrino  to  be  effectively  massless  and  thus  travel  at  the  speed  of 
light  with  momentum  pv.  The  nucleus  remains  stationary.  Find  the  momenta  pv  and  pe  of  the  emitted 
neutrino  and  electron,  as  well  as  the  speed  of  the  emitted  electron. 

14.4  A  proton  with  rest  mass  mp  and  momentum  pp  is  moving  in  the  positive  x-direction.  A  photon  with 
frequency  /  is  traveling  in  the  negative  x-direction,  and  collides  head-on  with  the  proton.  After  the 
collision,  both  proton  and  photon  are  traveling  in  the  positive  x-direction.  Show  that  the  frequency  f 
of  the  photon  after  the  collision  is  given  by 

Ep  +  epp 

Ep-cpp  +  2hfJ  ’ 

where  £p  is  the  energy  of  the  proton  before  the  collision. 

14.5  Particles  like  the  electrons  in  atomic  orbitals  can  be  in  a  low-energy  ground  state  (with  energy  Eq),  or,  by 
absorbing  a  photon,  be  put  in  a  higher-energy  excited  state  (with  energy  E[).  The  particle  can  return  to 
the  ground  state  by  emitting  another  photon.  Quantum  mechanics  tells  us  that  only  very  specific  states 
with  very  specific,  discrete  (or  ‘quantized’)  energies,  are  allowed. 

(a)  If  the  particle  is  initially  at  rest,  the  energy  of  an  incoming  photon  with  frequency  v  (and  energy 
E  =  hv)  has  to  be  slightly  larger  than  the  energy  difference  A E  —  E\  -  Eq  between  the  particle’s  ground 
and  exited  states  if  the  particle  is  to  absorb  the  photon.  Explain  why. 

(b)  Show  that  for  an  incoming  photon  that  is  absorbed  by  an  initially  stationary  particle,  we  have 

I  A  E\ 

l.r.  =  4£(1  +  — ). 


(c) 


Likewise,  show  that  for  an  initially  stationary  particle  in  the  excited  state  with  energy  E\,  the  energy 
of  the  emitted  photon  is  given  by 


hve  —  A  E 


(d)  Suppose  we  have  two  identical  atoms,  one  of  which  contains  an  electron  in  the  exited  state,  and  the 
other  only  electrons  in  the  ground  state.  The  atom  with  the  electron  in  the  exited  state  emits  a  pho¬ 
ton.  Is  there  a  possible  scenario  in  which  the  other  atom  absorbs  the  photon  (resonant  absorption)? 


14.6  Matter- antimatter  annihilation  and  creation  As  you  may  have  heard,  for  every  elementary  particle  of 
‘ordinary’  matter,  there  exists  an  antiparticle  of  ‘antimatter’,  which  shares  many  characteristics  with  its 
ordinary  counterpart  (such  as  the  mass),  whereas  others  are  opposite  (such  as  the  charge).  When  a 
particle  and  its  antiparticle  meet,  they  completely  annihilate,  converting  all  of  their  combined  mass 
into  pure  energy,  in  the  form  of  radiation  (i.e.  photons).  The  most  common  antiparticle  is  that  of  the 
electron,  which  is  known  as  the  positron.  First,  we  consider  an  experiment  in  which  an  ordinary  electron 
of  mass  me  with  momentum  pe  hits  a  positron  (mass  me)  at  rest,  at  which  point  the  two  annihilate, 
producing  two  photons. 


(a)  Argue  why  such  an  annihilation  must  produce  at  least  two  photons. 

(b)  One  of  the  two  produced  photons  emerges  at  an  angle  of  60°  to  the  direction  of  the  incident  electron. 
What  is  its  energy? 
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(c)  Find  the  angle  (with  the  direction  of  the  incident  electron)  at  which  the  other  photon  emerges. 

The  opposite  of  annihilation,  spontaneous  creation  of  matter,  can  also  happen:  then  a  photon  sponta¬ 
neously  converts  to  a  particle-antiparticle  pair. 

(d)  Why  must  the  photon  convert  to  a  particle-antiparticle  pair,  rather  than  simply  convert  to  a  single 
particle? 

(e)  Find  the  minimum  wavelength  a  photon  must  have  to  create  an  electron-positron  pair.  Where  is 
this  photon  in  the  electromagnetic  spectrum? 

14.7  Apion  (rest  mass  140  MeV/  c2)  with  a  momentum  of  25  MeV/c  decays  into  a  muon  (rest  mass  106MeV/c2) 
and  a  neutrino  (rest  mass  approximately  zero;  taken  zero  in  this  problem). 

(a)  Show  that 

pJl-Pv  =  -(ml  +  ml)c2, 

where  pn  and  /5/(  are  the  energy-momentum  four- vectors  of  the  pion  and  the  muon. 

(b)  Determine  the  kinetic  energy  of  the  outgoing  muon  for  the  case  that  it  travels  in  a  direction  that 
makes  a  90°  angle  with  that  of  the  incoming  pion. 

(c)  Again  for  the  case  that  the  direction  of  the  outgoing  muon  makes  a  90°  angle  with  that  of  the  incom¬ 
ing  pion,  find  the  direction  of  the  outgoing  neutrino. 

(d)  Finally,  again  for  the  same  case,  find  the  magnitude  of  the  (three-)  momentum  of  the  outgoing  neu¬ 
trino.  Express  your  answer  in  the  same  units  as  the  given  values  (MeV/c) 

14.8  A  neutral  kaon  (or  K  meson)  with  a  mass  of  498  MeV/c2  and  an  initial  velocity  of  c/2  decays  into  two 
pions  (one  with  a  positive  and  one  with  a  negative  charge),  each  of  which  has  a  mass  of  135  MeV/c2. 


before 
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b) 


after  (lab  frame) 
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© 


after  (lab  frame) 
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(a)  Find  the  speeds  and  the  angles  of  the  pions  in  the  lab  frame  if,  in  the  rest  frame  of  the  kaon,  they  are 
emitted  in  opposite  directions,  whose  line  makes  an  angle  of  90°  with  the  propagation  direction  of 
the  kaon? 

(b)  Answer  the  same  question  as  in  (a),  for  the  case  that  the  pions  are  emitted  one  in  the  same  and  one 
in  the  opposite  direction  as  the  kaon. 

(c)  Sometimes  a  kaon  decays  into  more  than  two  pions  (there  are  also  neutral  pions;  the  charges  of 
course  need  to  add  up  to  the  kaon  charge).  Determine  the  maximum  number  of  pions  that  our 
kaon  can  decay  into. 

(d)  Prove  that  in  any  situation,  the  trajectories  of  the  created  pions  are  in  one  plane.  Hint :  do  this  in  the 
kaon’s  rest  frame  first. 
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Relativistic  forces  and  waves 


15.1.  The  force  four-vector 


As  we’ve  discussed  in  chapter  above,  you  can  analyze  all  types  of  collisions  in  special  relativity  without 
ever  making  a  reference  to  the  forces  they  exert  on  each  other.  In  fact,  we  haven’t  talked  about  force  at  all 
so  far,  and  there’s  a  good  reason  for  that:  forces,  already  frequently  less  practical  than  energies  in  classical 
mechanics,  become  veritable  nightmares  in  special  relativity.  Nonetheless,  there  are  some  questions  you  can 
only  answer  with  reference  to  forces  -  for  example,  what  velocity  a  particle  will  get  if  you  exert  a  certain  force 
on  it  for  a  given  period  of  time. 

In  classical  mechanics,  Newton’s  second  law  relates  momenta  and  forces,  through  the  time  derivative 
of  the  momentum.  In  relativity,  we’ll  therefore  simply  define  the  force  four-vector  as  the  derivative  of  the 
energy-momentum  four  vector  with  respect  to  the  proper  time  (which  gives  a  four-vector,  as  you  can  check 
easily): 


dp 


1  d  E 


F-^-TM\-  —  ,Fx,Fy,F; 


c  dr 


(15.1) 


We  define  the  components  of  the  three-force  F  as  the  (‘regular’  or  ‘coordinate’)  time  derivative  of  the  three- 
momentum:  F  —  dpi  dr,  so  Newton’s  second  law  holds  as  long  as  you  don’t  change  your  frame  of  reference. 
Likewise,  Newton’s  third  law  holds,  if  you  consider  the  three-force  F  in  a  fixed  frame  of  reference.  The  zeroth 
term  of  F  contains  the  time  derivative  of  the  energy,  which  we  defined  as  the  power  in  section  3.1:  P  -  dE/ dt, 
again  within  the  context  of  a  fixed  frame  of  reference. 

There  is  a  classical  result  that  involves  the  force  that  does  translate  to  special  relativity  for  arbitrary  refer¬ 
ence  frames:  the  work-energy  theorem.  To  see  how  that  comes  about,  consider  a  Lorentz  transform  from  a 
comoving  system  (or  instantaneous  inertial  frames')  to  an  arbitrary  inertial  frame  S.  In  S',  j(u)  =  y(0)  =  1,  so 
the  space  components  of  the  force  four-vector  are  just  the  components  of  the  force  three-vector  (and  New¬ 
ton’s  second  law  holds);  moreover,  in  this  frame,  dE'  I  dt  =  d  [ml  \/ 1  -  (m')2/c2)/ dt  =  0,  because  the  derivative 
contains  a  factor  u' ,  which  (by  choice  of  frame)  is  zero.  We  thus  have  F'  —  (0,  F'x,  F'y,  F'z).  The  force  is  a  four- 
vector,  and  therefore  transforms  according  to  equation  (13.  ): 
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(15.2) 


so,  comparing  the  components  of  P  in  equations  (  )  and  (  .2),  we  get 


(15.3) 


The  longitudinal  force  is  thus  the  same  in  both  frames,  but  the  transversal  force  is  not!  Forces  thus  behave 
differently  than  you  might  naively  expect  under  Lorentz  transformations.  Moreover,  the  transformation  is  not 

1  Some  authors  use  /  to  avoid  confusion  with  the  four- force  F ;  others  use  F  for  the  three-force  and  K  for  the  four- force. 
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symmetric:  we  don’t  get  F'y  =  Fy/j{-  u)  (which  would  indeed  directly  contradict  equation  (  ).  The  reason 

why  we’ve  lost  this  symmetry  is  that  for  forces,  there  is  a  special  frame:  that  of  the  particle  (here  S'),  where 
Newton’s  second  law  holds.  In  all  other  frames,  we  have  to  transform  the  forces  according  to  equation  (15.:  ). 

There  is  a  silver  lining:  the  zero  component  in  equation  (  5.2)  gives  us  that  d E  =  uF'xdt  -  F'xdx  -  Fx  dx, 
which  integrated  gives  the  work-energy  theorem:  A E  =  F Ax  =  AM/.  As  long  as  we  stay  away  from  the  forces, 
work  and  energy  will  behave  as  we’ve  come  to  expect. 


15.2.  E  FOUR-ACCELERATION 

We  can  of  course  also  define  a  four-vector  version  of  the  acceleration,  by  taking  the  derivative  of  the  four- 
velocity  with  respect  to  the  proper  time.  As  with  the  forces,  we’ll  see  that  we’re  in  for  some  nasty  surprises,  this 
time  because  the  proper  time  derivative  acts  on  the  jin')  factor  in  the  velocity  as  well  as  on  the  components: 


a  = 


dv 

dr 


d  f  a  tt  ’  v  o  ,  u  *  v  a 

r(v)—Y(v)(.c,vx,vy,vz)=^Y  (0 - ,7  [v)a  +  j 


(15.4) 


where  we  used  the  time  derivative  of  the  y{  u)  function 


dy  d  1 

df  df  —  {vie)2 
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2  (l-(p/c)2)3/2 


v  diM  ,  va 


(15.5) 


and  we’ve  introduced  the  (classical)  acceleration  three-vector  as  the  coordinate  time  derivative  of  the  velocity 
three-vector:  a  —  dvl  dt.  As  you  can  see  in  equation  (  ),  the  four- acceleration  has  terms  that  scale  with  y2 

and  terms  that  scale  with  y4,  making  it  an  inconvenient  object  to  work  with.  Geometrically  though,  it  has  a 
clean  interpretation,  which  comes  into  view  once  you  consider  the  inner  product  between  the  acceleration 
and  velocity  four-vectors: 


d  ■  v  =  y5  ( v)  ( a  ■  v)  -  y3  ( v)  (a  ■  v)  -  y5  ( v) 

c2 


(v-v) 


=  y3(v)(a- v) 


1  -v2lc2 
1  -  V2  /  c2 


=  0. 


(15.6) 


These  four-vectors  are  therefore  (in  the  four-vector  sense)  always  perpendicular!  That  seems  odd  from  a 
classical  point  of  view:  if  you  move  in  the  x-direction,  and  speed  up,  both  velocity  and  acceleration  point 
in  the  same  direction  and  are  thus  certainly  not  perpendicular.  We  do  have  a  perpendicular  case  of  course: 
circular  motion  (with  a  velocity  along  the  circle,  and  acceleration  inwards).  Something  similar  happens  here, 
if  you  consider  the  world  line  of  a  particle  in  a  spacetime  diagram  (see  figure  ).  You  can  think  of  this  line 
as  a  curve  that’s  parametrized  by  the  proper  time  r;  points  on  the  curve  are  then  given  by  the  position  four- 
vector  at  time  r.  The  velocity  four-vector  is  the  normalized  tangent  to  this  line  (and  indeed,  by  construction, 
has  a  fixed  length  c).  When  you’re  moving  at  constant  velocity,  the  line  is  straight,  but  if  you  change  your 
velocity  (i.e.,  you  accelerate),  the  line  curves.  The  acceleration  four-vector  both  quantifies  that  curvature, 
and  points  in  the  direction  that  the  curve  is  bendings . 

Because  by  definition  p  -  mv,  and  F  =  dpi  dr,  as  long  as  the  mass  is  conserved  (d ml  dr  =  0),  we  do  have 
F  —  ma,  so  Newton’s  second  law  does  hold  for  four-vectors.  This  result  is  not  nearly  as  useful  as  in  classical 
mechanics  though,  since  as  we’ve  seen,  forces  transform  in  unwieldly  manners  between  inertial  frames,  and 
the  acceleration  can  only  curve  the  trajectory  in  spacetime. 

To  see  how  forces  and  accelerations  can  be  used  for  a  case  where  you  have  no  choice  but  to  use  them  , 
consider  a  particle  that  is  under  constant  acceleration,  due  to  a  constant  three-force  acting  on  it  in  the  (non- 
inertial!)  co-moving  frame  of  the  particle  (e.g.  due  to  a  rocket  engine  attached  to  the  particle).  From  the  point 
of  view  of  an  inertial  rest  frame,  we  have 


dp 

dt 


F  and 


(15.7) 


2There  is  a  one-on-one  correspondence  between  these  ‘world  curves’  and  affinely  parametrized  curves  in  real  space  of  two  or  more 
dimensions.  There  too,  you  can  define  a  tangent  vector  as  the  derivative  of  the  position  vector,  which  for  an  affinely  parametrized  curve 
is  always  of  unit  length.  The  derivative  of  the  tangent  vector,  known  as  the  normal,  is  always  perpendicular  to  the  tangent,  and  points 
in  the  direction  in  which  the  curve  is  bending;  its  magnitude  quantifies  the  curvature. 

3 I’m  sure  you’ve  noted  the  obvious  pun  here. 
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Figure  15.1:  Spacetime  diagram  showing  accelerated  motion.  The  position  four-vector  gives  the  collection  of  points  in  spacetime  that 
the  world  line  passes  through,  parametrized  by  the  proper  time  r.  The  velocity  four-vector  (red)  is  the  normalized  tangent  to  that  line, 
and  the  acceleration  four- vector  (green),  which  is  always  perpendicular  to  the  velocity  four- vector,  its  curvature. 


Choose  the  x-axis  to  be  along  the  direction  of  F,  and  define  a  -  ax  =  Fxlm.  Then 

d  ipxlm)  Awx 

a  — - = - , 

df  df 


(15.8) 


where  w  =  pi m  -  j(v) v,  and,  as  we  have  only  motion  in  the  positive  x-direction  here,  we  have  wx  =  w, 
vx  =  v.  Solving  equation  (  )  for  w,  we  get  the  velocity  of  a  uniformly  accelerated  particle:  w  ( t)  -  w(0)  +  at. 

Now  solving  for  the  actually  measured  velocity  in  the  inertial  frame  (taking  w{ 0)  =  0),  we  find 

->  o  •>  I  v2  \  at 

r{v[t))v(t)=  w(t)  =  at  =>  v  =ar  1 - y  =>  v  =  (15.9) 

\  c2)  \/l  +  a2t2lc2 


Figure  5.2  compares  the  relativistic  velocity  with  the  classical  result.  Unsurprisingly,  they  are  initially  iden¬ 
tical,  but  once  the  speed  starts  picking  up,  we  see  that  the  classical  results  starts  to  deviate  significantly.  In 
particular,  the  relativistic  result  confirms  that  no  matter  how  long  we  accelerate,  our  particle  will  never  exceed 
the  speed  of  light. 

On  a  side  note,  we  can  also  solve  for  the  actual  trajectory  of  our  particle:  simply  integrate  d xl  df  =  v(  t), 
which  gives 

c2 

x(f)  =  — 
a 

For  small  values  of  t ,  we  (again)  recover  the  classical  result,  x-  |  at 2 . 


2  f2 


1  + 
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(15.10) 


15.3.  Relativistic  waves 

We’ve  seen  that  in  special  relativity,  space  and  time  are  intimately  coupled.  There  is  a  classical  phenomenon 
for  which  this  is  also  the  case:  the  waves  we  discussed  in  chapter  .  In  section  we  introduced  the  sinusoidal 
wave,  described  by  (equation  9.  ): 

u(x,  f)  =  Acos(k-x- d)t).  (15.11) 

In  equation  (  )  we  made  the  wave  a  function  of  all  three  space  coordinates,  introducing  a  wave  vector 

k  rather  than  just  the  wave  number  k  of  equation  (  ).  The  magnitude  of  the  wave  vector  is  simply  that  of 

the  wave  number,  \k\  =  k  =  2nl  A,  while  its  direction  represents  the  direction  the  (traveling)  wave  is  moving 
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v/c 


Figure  15.2:  Velocity,  as  measured  in  an  inertial  rest  frame,  for  a  particle  that  undergoes  constant  acceleration,  according  to  classical 
mechanics  (orange)  and  special  relativity  (blue). 


in.  When  written  like  equation  (  ),  you  may  guess  that  there  exists  a  wave  four-vector  combining  the 

temporal  and  spatial  properties  of  the  wave,  and  you  would  be  correct.  If  we  define 

k  =  [o)lc,  tc),  (15.12) 

then  the  argument  of  the  cosine  in  equation  (  ),  i.e.,  the  phase  <p(x,  t)  of  the  wave  at  the  given  point  in 

space  and  time,  is  given  by 

(p(x,  t)  =  k  x- cot  - -(k-x).  (15.13) 

We’ve  already  shown  that  dot  products  of  two  four-vectors  are  invariant  under  Lorentz  transformations;  as  <p 
is  a  scalar  (and  thus  invariant)  and  x  a  four-vector,  it  follows  that  1c  is  indeed  also  a  four-vector. 

The  main  application  of  relativistic  waves  is  light  itself  -  in  its  occurrence  as  a  wave.  The  wave  four- vector 
of  a  light  beam  traveling  in  the  positive  x-direction  is  given  by 

k=(k,k,  0,0),  (15.14) 


where  we  used  that  for  light,  co-ck  (see  equation  ).  Unsurprisingly,  this  looks  exactly  like  equation  (  ) 

for  the  four-momentum  of  a  photon  -  especially  because  that  the  energy  of  a  photon  is  E  —  hc/X  -  hck/2n. 
Up  to  a  physical  constant,  the  wave  and  momentum  four  vector  of  light  are  thus  identical: 


Pphoton 


~2n&' 


’photon 


=  hlc. 


photon- 


(15.15) 


The  combination  Ii/2ti  occurs  so  often  that  it  got  its  own  symbol,  h  {‘h- bar’).  Note  that  equation  (  )  holds 

for  light  only. 

You  might  expect  that  there  is  little  more  to  say  about  light.  After  all,  the  light  postulate  ensures  that  the 
speed  of  light  will  be  the  same  for  all  observers.  Yet,  different  observers  can  observe  the  same  ray  of  light 
(or  the  same  photon)  differently:  although  its  speed  is  invariant,  its  frequency  (and  thus  its  color,  as  well  as 
its  momentum)  is  not!  To  see  what  happens,  let  us  start  with  a  stationary  light  source  emitting  rays  in  the 
positive  x-direction  in  some  system  S,  so  the  wave  four- vector  is  given  by  equation  (  ).  We  now  Lorentz  - 

transform  to  a  system  S'  moving  with  speed  u  in  the  x  direction.  The  wave  four-vector  as  measured  by  an 
observer  in  S'  is  simply  the  Lorentz  transform  of  (  ): 
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(15.16) 


We  find  that  the  moving  observer  still  sees  the  light  moving  in  the  positive  x  direction  with  speed  c,  but  with 
a  different  wave  number  /c',  and  thus  a  different  frequency  to'  =  ck\  given  by 


a»'  =  y(K)(l- “)w  =  ^ 


ulc 


+  ulc 


-u>. 


(15.17) 
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Equation  (  )  gives  the  relativistic  Doppler  effect:  a  shift  in  observed  frequency  due  to  the  motion  of  the 

observer,  just  as  we  found  for  sound  in  section  9  .  In  fact,  equation  (  17)  reduces  to  (  )  for  small  veloci¬ 

ties  u«c.  In  addition  to  the  ‘sound  effect’  where  we  account  for  the  stretching  or  compression  of  the  waves 
due  to  the  motion  of  the  observer,  the  relativistic  Doppler  effect  also  accounts  for  the  time  dilation  between 
the  two  observers  (it  can  also  be  derived  by  combining  these  two  effects,  as  is  done  in  many  textbooks,  see 
problem  3.2).  Unlike  for  sound,  there  is  also  a  transverse  relativistic  Doppler  effect  (entirely  due  to  the 
time  dilation),  for  which  we  can  find  the  expression  by  replacing  the  ray  traveling  in  the  positive  x  direction 
in  equation  (  16)  with  one  traveling  in  the  positive  y  direction. 
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15.4.  Problems 

15.1  Two  spaceships  are  connected  by  a  strong  cable.  Both  ships  are  initially  at  rest,  and  ignite  their  engines 
at  the  same  time.  Their  accelerations  are  identical  at  every  point  in  time.  Does  the  cable  stay  intact? 
Hint :  use  a  spacetime  diagram  to  analyze  what  happens. 

15.2  In  classical  mechanics,  the  Doppler  shift  in  the  wavelength  of  an  object  moving  towards  the  observer  at 
speed  u  is  given  by  (equation  9.19): 


v 

where  v  is  the  speed  of  the  wave  (usually  sound).  To  get  this  result,  we  compared  what  happens  with 
the  source  and  the  wave  in  a  fixed  time  interval  At.  As  you  now  know,  this  result  cannot  hold  at  speeds 
close  to  that  of  light,  because  in  that  case  there  will  be  a  significant  effect  due  to  time  dilation.  In  this 
problem,  we’ll  therefore  redo  the  calculation  to  account  for  relativistic  effects. 

We  consider  a  distant  source  of  light  that  moves  with  velocity  it.  At  time  t  —  0  (for  both  the  source  and 
the  stationary  observer),  the  source  emits  a  signal  (this  could  be  a  wavecrest,  but  the  argument  holds 
for  any  signal).  A  time  A t'  later,  as  measured  on  the  clock  moving  with  the  source,  the  source  emits  a 
second  signal,  see  figure  5.3. 


Figure  15.3:  Setting  for  calculating  the  relativistic  Doppler  effect.  As  the  (stationary)  observer  is  distant,  the  signals  traveling  to  the 
observer  from  the  source  can  be  taken  parallel  to  each  other;  they  make  an  angle  6  with  the  direction  in  which  the  source  moves.  Note 
that  both  the  distance  between  the  position  of  the  source  at  the  emission  of  the  first  and  second  signal  and  the  difference  in  distance 
traveled  by  the  signal  matter. 


(a)  Determine  the  time  interval  Af  between  the  emission  of  the  first  and  second  signal  as  measured  on 
the  clock  of  the  stationary  observer. 

(b)  Determine  the  change  in  distance  Ax  between  the  (stationary)  observer  and  the  (moving)  source  in 
the  time  interval  between  the  two  signals,  as  measured  by  the  stationary  observer. 

(c)  Now  determine  the  time  interval  At0bS  between  the  arrival  of  the  first  and  second  signal  at  the  loca¬ 
tion  of  the  observer. 


(d) 


From  your  answers  at  (a-c),  show  that  the  observed  frequency  v0bS  is  related  to  the  source’s  fre¬ 
quency  vs  through 


V 1  -  u2  /  c2 

vobs  —  .  .  rr  77  vs. 

1  +  (zi/c)  cost) 


(15.18) 


Note  that  equation  (15.18)  reduces  to  (  5.17)  in  the  case  that  the  source  is  moving  radially  away  from 
the  observer. 


(e)  From  equation  (  18),  find  the  expression  for  the  (relativistic)  transverse  Doppler  shift  for  the  case 

that  the  source  is  moving  in  a  direction  perpendicular  to  the  observer’s  line  of  sight  (i.e.,  6  —  90°). 
How  can  you  tell  that  in  this  case  the  Doppler  shift  is  exclusively  due  to  time  dilation? 
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A.l.  Vector  basics 

Classical  mechanics  describes  the  motion  of  bodies  as  they  move  through  space.  To  describe  a  motion  in 
space  it  is  not  sufficient  to  give  a  position  and  a  speed:  you  need  a  direction  as  well.  Therefore  we  work  with 
vectors:  mathematical  objects  that  have  both  a  magnitude  and  a  direction.  If  you  tell  me  you’re  moving,  I 
know  something,  but  not  much;  I’ll  know  more  if  you  tell  me  you’re  moving  at  walking  speed,  and  have  full 
information  of  your  velocity  once  you  tell  me  that  you’re  moving  at  walking  speed  towards  the  coffee  machine. 
Although  in  principle  we  could  make  do  with  specifying  a  magnitude  and  direction  of  every  vector  in  this  way, 
it  is  often  more  convenient  to  express  our  vectors  in  a  basis.  To  do  so,  we  choose  an  (arbitrary)  origin,  and 
as  many  basis  vectors  as  we  have  spatial  dimensions,  in  such  a  way  that  they  are  not  parallel  to  one  another, 
and  usually  mutually  perpendicular  (orthogonal)  and  of  unit  length  (orthonormal).  Then  we  decompose  our 
vector  by  giving  its  components  along  each  of  the  basis  vectors.  The  most  common  choice  is  to  use  a  Cartesian 
basis,  of  two  or  three  (depending  on  spatial  dimension)  basis  vectors  of  unit  length  pointing  in  the  standard 
x,  y  and  z  directions,  and  indicated  as  x,  y  and  z,  or  (rather  annoyingly)  sometimes  as  i,  j  and  k,  the  latter 
especially  in  American  textbooks.  Other  often  encountered  systems  are  polar  coordinates  (2D)  and  cylindrical 
and  spherical  coordinates  (3D),  see  the  mathematical  appendix  for  more  background  on  those.  To  write  our 
vectors,  we  now  specify  the  components  in  each  direction,  writing  for  example  v  =  3x  +  3y  for  a  vector  (in 
boldface)  representing  a  speed  of  3 C2  and  a  direction  making  an  angle  of  45°  with  the  horizontal. 

Vectors  can  be  added  and  subtracted  just  like  scalars  -  simply  add  and  subtract  them  by  component. 
Graphically,  you  add  two  vectors  by  putting  them  head-to-tail:  you  can  find  the  sum  of  two  vectors  v  and  w 
by  putting  the  start  of  w  at  the  end  of  v,  the  sum  v+w  then  points  from  the  start  of  v  to  the  end  of  w.  You  can 
also  multiply  a  vector  by  a  scalar,  by  multiplying  every  component  of  the  vector  with  that  scalar.  Graphically, 
this  means  that  you  extend  the  length  of  the  vector  with  the  scalar  factor  you  just  multiplied  with. 

You  can’t  take  the  product  of  two  vectors  like  you  would  two  scalars.  There  are  however  two  vector  op¬ 
erations  that  closely  resemble  the  product,  known  as  the  inner  (or  dot)  and  outer  (or  cross)  product,  see 
figure  .  The  dot  product  represents  the  length  of  the  projection  of  one  vector  on  another  (and  thus  gives 
a  scalar);  it  is  zero  for  perpendicular  vectors,  and  the  dot  product  of  a  vector  with  itself  gives  the  square  of  its 
length.  To  calculate  the  dot  product  of  two  vectors,  sum  the  products  of  their  components:  if  v  -  vxx  +  vyy 
and  w  =  wxx  +  wyy,  then  v ■  w  =  vxwx  +  vywy.  You  can  use  the  dot  product  to  find  the  angle  between  two 
vectors,  using  standard  geometry,  which  gives 


cosd  - 


v-  w 

\v\\w\ 


VXWX+  VyWy 

\v\\w\ 


(A.l) 


where  \  v\  and  \  w\  are  the  lengths  of  vectors  v  and  w,  respectively.  The  cross  product  is  only  defined  for 
three-dimensional  vectors,  say  v  =  vxx  +  vyy  +  vzz  and  w  =  wxx+  wyy+  wzz.  The  result  is  another  vector, 
with  a  direction  perpendicular  to  the  plane  spanned  by  v  and  w,  and  a  magnitude  equal  to  the  area  of  the 
parallelogram  bounded  by  them.  The  cross  product  is  most  easily  expressed  in  column  vector  form: 
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Figure  A.1:  Products  and  derivatives  of  vectors,  (a)  The  dot  product  of  two  vectors  a  and  b  gives  the  length  of  the  projection  of  a  on 
b  (or  vice  versa),  (b)  The  cross  product  of  two  vectors  a  and  b  gives  a  vector  perpendicular  to  the  plane  spanned  by  a  and  b  and  with 
a  magnitude  equal  to  the  area  of  the  parallelogram  spanned  by  a  and  b.  (c-f)  Acceleration  as  a  change  in  velocity.  In  every  panel,  the 
original  velocity  is  shown  in  black,  the  velocity  a  short  time  A  t  later  is  shown  in  blue,  and  the  acceleration  is  shown  in  red.  (c)  Acceleration 
for  an  increase  in  the  magnitude  of  the  velocity,  (d)  Acceleration  for  a  decrease  in  the  magnitude  of  the  velocity,  (e)  Acceleration  for  a 
change  in  direction  of  the  velocity  at  constant  magnitude,  (f)  Acceleration  for  a  change  in  velocity  that  involves  both  a  change  in  direction 
and  a  decrease  in  magnitude. 


The  cross  product  of  a  vector  with  itself  is  zero. 

Vectors  can  be  functions,  just  like  scalar  quantities:  they  can  depend  on  one  or  more  parameters,  like  po¬ 
sition  or  time.  Also,  again  just  like  scalar  functions,  you  can  calculate  a  rate  of  change  of  vector  function  as  you 
move  through  parameter  values,  for  instance  asking  how  the  velocity  of  a  car  changes  as  a  function  of  time. 
An  instantaneous  rate  of  change  is  simply  a  derivative,  which  is  calculated  in  exactly  the  same  manner  as  the 
derivative  of  a  scalar  function.  For  example,  the  rate  of  change  of  the  velocity,  known  as  the  acceleration  a,  is 
defined  as: 


v[t  +  At)  -  v 

a-  lim  - 

At— 0  A  t 


(A.3) 


Since  the  velocity  itself  is  the  derivative  of  the  position  x(f),  the  acceleration  is  also  the  second  derivative  of 
the  position.  Time  derivatives  occur  so  frequently  in  classical  mechanics  that  we  use  a  special  notation  for 
them:  a  first  derivative  is  indicated  by  a  dot  on  top  of  the  quantity,  and  a  second  derivative  by  a  double  dot  - 
so  we  have  a  =  v  —  x. 

Vector  derivatives  are  somewhat  richer  than  those  of  scalar  functions,  since  there  are  more  ways  that  a 
vector  can  change.  Like  a  scalar  function,  the  magnitude  of  a  vector  can  increase  or  decrease.  Moreover,  its 
direction  can  also  change,  which  also  means  that  it  has  a  nonzero  derivative,  and  of  course,  you  can  have  a 
combination  of  a  change  in  magnitude  and  a  change  in  direction,  see  figure 
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Figure  A.2:  Examples  of  scalar  and  vector  fields,  (a)  Temperature  (a  scalar)  at  every  point  in  the  Netherlands  at  11:30  on  October  28, 2013. 
Lines  of  equal  temperature  are  drawn  in  black;  the  gradient  of  the  temperature  field  is  everywhere  perpendicular  to  these  lines,  (b)  Wind 
velocity  (a  vector)  at  every  point  in  the  Netherlands  at  the  same  time  as  the  temperature  map.  Colors  correspond  to  the  magnitude  of 
the  velocity,  (a  and  b)  from  [30].  (c)  Example  of  a  vector  field  with  a  large  divergence  and  zero  curl,  (d)  Example  of  a  vector  field  with  a 
large  curl  (and  low  but  nonzero  divergence). 
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Functions  (scalar  or  vector)  that  are  defined  at  every  point  in  space  are  sometimes  called  fields.  Examples 
are  the  temperature  (scalar)  and  wind  (vector)  at  every  point  on  the  planet,  see  figure  A.2.  Just  like  you  can 
calculate  the  rate  of  change  of  a  function  in  time,  you  can  also  consider  how  a  function  changes  in  space.  For 
a  scalar  function,  this  quantity  is  a  vector,  known  as  the  gradient,  defined  as  the  vector  of  partial  derivatives. 
For  a  function  f{x,y,z),  we  have: 


V/  = 


d/„  df\  df\ 
—x+  —y+  —z 
dx  dy  dz 


dfldx  I 
dfldy  . 
dfldz  ) 


(A.4) 


The  direction  of  V/  is  the  direction  of  maximal  change,  and  its  magnitude  tells  you  how  quickly  the  function 
changes  in  that  direction.  For  a  vector  field  v,  we  can’t  take  the  gradient,  but  we  can  use  the  ‘vector’  V  of 
partial  derivatives  combined  with  either  the  dot  or  cross  product.  The  first  option  is  known  as  the  divergence 
of  v,  and  tells  you  how  quickly  v  spreads  out;  the  second  is  the  curl  of  v  and  tells  you  how  much  v  rotates: 


....  „  dvx  dvy  dvz 

div(i/)  =  V  •  v  — - + - + - , 

dx  dy  dz 


curl(ifi  -  V  x  v  - 


dyVz- 


dZVy 


dzVr-drVz 


dxi V 


dyV  x 


(A.5) 

(A.  6) 


where  dx  =  did x,  and  so  on. 

A.2.  Polar  coordinates 

You  can  specify  any  point  in  the  plane  by  specifying  its  projection  on  two  perpendicular  axes  -  we  typically 
call  these  the  x  and  y-axes  and  x  and  y  coordinates.  In  this  Cartesian  system  (named  after  Descartes),  we 
identify  unit  vectors  x  and  y,  pointing  along  their  respective  axes,  and  being  of  unit  length.  A  position  r  can 
then  be  decomposed  in  the  two  directions:  r  -  rxx  +  ryy,  with  rx  -  r  ■  x  and  ry  -  r  ■  y.  Alternatively,  we  can 

write  x  —  I  ^  ]  and  y  =  I  ^  J ,  which  gives  for  r : 


r  -  rxx+ryy-  |  ^  j. 

Instead  of  specifying  the  x  and  y  coordinates  of  our  position,  we  could  also  uniquely  identify  it  by  giving 
two  different  numbers:  its  distance  to  the  origin  r,  and  the  angle  9  the  line  to  the  origin  makes  with  a  fixed 
reference  axis  (typically  the  x-axis),  see  figure  .  Invoking  the  Pythagorean  theorem  and  basic  trigonometry, 
we  readily  find  r  -  \J rx  +  ry  and  tanO  =  rylrx.  We  call  r  the  length  of  the  vector  r.  We  could  also  invert  the 
relations  for  r  and  6  so  we  can  get  the  Cartesian  components  if  the  length  and  angle  are  known:  rx-r  cos  6 
and  r y  -  rsinO. 

Like  the  Cartesian  basis  vectors  x  and  y,  which  point  in  the  direction  of  increasing  x  and  y  values,  we 
can  also  define  unit  vectors  pointing  in  the  direction  of  increasing  r  and  9.  These  directions  do  depend  on 
our  position  in  space,  but  they  do  have  a  clear  geometrical  interpretation:  f  always  points  radially  outward 
from  the  origin,  and  0  in  the  direction  you’d  move  if  you’d  be  making  a  counterclockwise  rotation  about  the 
origin.  Given  a  position  vector  r,  finding  the  vector  in  the  direction  of  increasing  r  is  very  easy:  f  -  rlr.  The 
expression  for  r  in  our  new  polar  basis  (r,  6)  is  almost  tautological:  r  -  rr. 

Relating  the  polar  basis  vectors  to  the  Cartesian  ones  is  straightforward.  We  have: 


r=  rxx+ryy=  rr, 

and  using  rx-r  cos 9,  ry  -  r  sin (7  we  also  have 

r  —  rcos9x+  rsindy. 

We  thus  find  that  r  =  cos  Ox  +  sindy.  For  9  we  note  that  to  rotate  around  the  origin,  the  direction  of  motion 
needs  to  be  perpendicular  to  r.  There  are  of  course  two  such  directions  -  we  pick  the  sign  by  demanding  that 


160 


A.  Math 


Figure  A.3:  Polar  coordinate  system  and  polar  unit  vectors.  A  position  r  in  the  plane  can  be  specified  either  by  giving  its  projections  on 
two  reference  axes  (along  the  x  and  y  direction) ,  or  by  giving  its  distance  r  to  the  origin,  and  the  angle  8  the  line  to  the  origin  makes  with 
the  x  axis.  The  polar  basis  vectors  are  defined  as  pointing  in  the  direction  of  increasing  r  (i.e.,  radially  outward),  and  increasing  8  (i.e., 
rotating  counterclockwise  around  the  origin) . 


the  counterclockwise  rotation  is  positive.  This  gives  0  =  (ryl r)x  -  ( rxlr)y  =  sin  Ox  -  cos 9y.  Written  out  as 
vectors,  we  have: 


Note  that 


“1).  *=(  sln0J- 

sinfl  )  (  -  cost!  ) 

30  A  dr 

r  =  — ,  0  = - . 

30  38 


(A.7) 


(A.8) 


Naturally,  we  can  also  express  the  Cartesian  basis  in  terms  of  the  polar  ones: 


x  =  cosdr  +  sin00,  y  —  sindr  -  cosBO. 


(A.9) 


A.3.  Solving  differential  equations 

A  differential  equation  is  an  equation  which  contains  derivatives  of  the  function  to  be  determined.  They  can 
be  very  simple.  For  example,  you  may  be  given  the  (constant)  velocity  of  a  car,  which  is  the  derivative  of  its 
position,  which  wed  write  mathematically  as: 

dx 

v=-t~  =  vo-  (A.  10) 

df 

To  determine  where  the  car  ends  up  after  one  hour,  we  need  to  solve  this  differential  equation.  We  also  need 
a  second  piece  of  information:  where  the  car  was  at  some  reference  time  (usually  t  —  0),  the  initial  condition. 
If  x(0)  =  0,  you  don’t  need  advanced  maths  skills  to  figure  out  that  x(\  hour)  =  i>q  •  ( 1  hour).  Unfortunately, 
things  aren’t  usually  this  easy. 

Before  we  proceed  to  a  few  techniques  for  solving  differential  equations,  we  need  some  terminology.  The 
order  of  a  differential  equation  is  the  order  of  the  highest  derivative  found  in  the  equation;  equation  (  )  is 

thus  of  first  order.  A  differential  equation  is  called  ordinary  if  it  only  contains  derivatives  with  respect  to  one 
variable,  and  partial  if  it  contains  derivatives  to  multiple  variables.  The  equation  is  linear  if  it  does  not  contain 
any  products  of  (derivatives  of)  the  unknown  function.  Finally,  a  differential  equation  is  homogeneous  if  it 
only  contains  terms  that  contain  the  unknown  function,  and  inhomogeneous  if  it  also  contains  other  terms. 
Equation  (  )  is  ordinary  and  inhomogeneous,  as  the  vq  term  on  the  right  does  not  contain  the  unknown 

function  x(t).  In  the  sections  below,  we  discuss  the  various  cases  you’ll  encounter  in  this  book;  there  are  many 
others  (many  of  which  can’t  be  solved  explicitly)  to  which  a  whole  subfield  of  mathematics  is  dedicated. 

A.3.1.  First-order  linear  ordinary  differential  equations 

Suppose  we  have  a  general  equation  of  the  form 

a(*)4^  +  fctt)x(f)  =  /(f), 
at 


(A.11) 
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where  a{t),  b{t)  and  /(f)  are  known  functions  of  f,  and  x(f)  is  our  unknown  function.  Equation  (  )  is  a 

first-order,  ordinary,  linear,  inhomogeneous  differential  equation.  In  order  to  solve  it  we  will  use  two  tech¬ 
niques  that  are  tremendously  useful:  separation  of  variables  and  separation  into  homogeneous  and  particular 
solutions. 

Suppose  we  had  /(f)  =  0.  Then,  if  we  had  two  solutions  x  |  ( f )  and  x2  ( f)  of  equation  (  ),  we  could  con¬ 

struct  a  third  as  x\(t)  +  x2 ( f)  (or  any  linear  combination  of  x |  ( f)  and  x2(f)),  since  the  equation  is  linear.  Now 
since  /(f)  is  not  zero,  we  can’t  do  this,  but  we  can  do  something  else.  First,  we  find  the  most  general  solution 
to  the  equation  where  /(f)  =  0,  which  we  call  the  homogeneous  solution  X|,  ( t) .  Second,  we  find  a  solution 
(any  at  all)  of  the  full  equation  (A.l  ),  which  we  call  the  particular  solution  xp(f).  The  full  solution  is  then  the 
sum  of  these  two  solutions,  x(f)  =  xj,  ( f )  +  xp  ( f ) .  You  may  worry  that  there  may  be  multiple  particular  solu¬ 
tions:  how  would  we  pick  the  ‘right’  one?  Fortunately,  we  don’t  need  to  worry:  the  homogeneous  solution  will 
contain  an  unknown  variable,  which  will  be  set  by  the  initial  condition.  Changing  the  particular  solution  will 
change  the  value  of  the  variable,  such  that  the  final  solution  will  be  the  same  and  satisfy  both  the  differential 
equation  and  the  initial  condition. 

To  find  the  solution  to  the  homogeneous  equation 


dxv, 

a(t)——  +  i?(f)Xh(t)  =  0, 
df 


(A.  12) 


we’re  going  to  use  a  technique  called  separation  of  variables.  There  are  two  variables  in  this  system:  the 
independent  parameter  f  and  the  dependent  parameter  x.  The  trick  is  to  get  everything  depending  on  f  on 
one  side  of  the  equals  sign,  and  everything  depending  on  x  on  the  other.  To  do  so,  we’re  going  to  treat  dxl  df 
as  if  it  were  an  actual  fraction  .  In  that  case,  it’s  not  hard  to  see  that  we  can  re-arrange  equation  (  )  to 


1  b{t) 

—  dxh  = - df.  (A.13) 

Xh  a{t ) 


By  itself,  equation  (  )  means  little,  but  if  we  integrate  both  sides,  we  get  something  that  makes  sense: 


or 


bit) 


r  i  r 

I  —  dxh  =  log(xh)  +  C  =  -  /  —f  df 
J  xh  J  a{t ) 


Xh  ( f)  =  A  exp  - 


rb[ f) 

J  fl(f) 


df 


(A.  14) 


(A.  15) 


where  A  =  exp (C)  is  an  integration  constant  (the  unknown  constant  that  will  be  set  by  our  initial  condition). 
Of  course,  in  principle  it  may  not  be  possible  to  evaluate  the  integral  in  equation  (A.  15),  but  even  then  the 
solution  is  valid.  In  practice,  you’ll  often  encounter  situations  in  which  a(  t)  and  b(  t)  are  simple  functions  or 
even  constants,  and  the  evaluation  of  the  integral  is  straightforward. 

Now  that  we  have  our  homogeneous  solution,  we  still  need  a  particular  one.  Sometimes  you’re  lucky,  and 
you  can  easily  guess  one  -  for  instance  one  in  which  xp(f)  doesn’t  depend  on  f  at  all.  In  case  you’re  not  lucky, 
there’s  are  two  other  techniques  you  may  try,  either  using  variation  of  constants  or  finding  an  integrating 
factor.  To  demonstrate  variation  of  constants,  we’ll  pick  a  specific  example,  to  not  get  lost  in  a  bunch  of  ab¬ 
stract  functions.  Let  a{t)~  a  be  a  constant  and  b{t)  —  bt  be  linear.  The  homogeneous  solution  then  becomes 
Xh(f)  =  A  exp  [-  j  |  f2] .  The  constant  we’re  going  to  vary  is  our  integration  constant  A,  so  our  guess  for  the 
particular  solution  will  be 


xp(f)  =  A(f)  exp 


(A.  16) 


We  substitute  (  )  back  into  the  full  differential  equation  (  ),  which  gives: 


dA  bt 

1  by 

d  A 

1  by 

a - flA(f) - F  hfA(f) 

df  a 

exp 

- f2 

2  a 

=  a —  exp 
df 

- f2 

2  a 

(A.  17) 


1  If  you’re  worried  about  the  validity  of  this  shortcut,  consider  that  we  could  also  have  left  dxj,/  df  intact,  to  give  -4-  d  %/  df  =  —  h(f)/o(f). 
Integrating  both  sides  over  time  gives 


[  —  dxhldtdt  =  f  —  dxh  =  -  [  —  df, 

J  *h  J  xh  h  J  a(t) 


which  takes  us  back  to  equation  (A.  14). 
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A  big  part  of  the  left-hand  side  thus  cancels,  and  that’s  not  a  coincidence  -  that’s  because  it  is  based  on  the 
homogeneous  equation.  What  remains  is  a  differential  equation  in  Ait)  that  can  be  trivially  solved  by  direct 
integration: 

r  dA  1  r 

Ait)  =  I  —  df=-  I  /(f) exp 
J  at  aj 


1  b  ? 
--r 

2  a 


dt. 


(A.  18) 


Again,  it  may  not  be  possible  to  evaluate  the  integral  in  equation  (A.  18),  but  in  principle  the  solution  could 
be  inserted  in  equation  (  )  to  give  us  our  particular  solution,  and  the  whole  differential  equation  will  be 

solved. 

Alternatively,  we  may  try  to  find  an  integration  factor  for  equation  (  ).  This  means  that  we  try  to  re¬ 

write  the  left  hand  side  of  the  equation  as  a  total  derivative,  after  which  we  can  simply  integrate  to  get  the 
solution.  To  do  so,  we  first  divide  the  whole  equation  by  ait),  then  look  for  a  function  nit)  that  satisfies  the 
condition  that 

d  r  .  dx  du  dx  bit) 

—  p(f)x(f)  =  p(f)  —  +  x(f)  —  =  p(f)  —  +  p(f)— -xit),  (A.  19) 

dt 1  1  dt  dt  dt  ait) 

from  which  we  can  read  off  that  we  need  to  solve  the  homogeneous  equation 


d/i 

df 


b{t) 
ait ) 


Hit). 


We  can  solve  (A.20)  by  separation  of  constants,  which  gives  us 


flit)  -  exp 


(A.20) 


(A.21) 


where  we  set  the  integration  constant  to  one,  as  it  drops  out  of  the  equation  for  xit)  anyway.  With  this  func¬ 
tion  ni f).  we  can  rewrite  equation  (A.  )  as 


which  we  can  integrate  to  find  xit): 


d  ,  ,  fit) 

i  r  fw 

xit)  =  —  /  nit)- —  dt. 
nit)  ait) 


(A.22) 


(A.23) 


A.3.2.  Second- order  linear  ordinary  differential  equations  with  con¬ 
stant  COEFFICIENTS 

Second  order  ordinary  differential  equations  are  essential  for  the  study  of  mechanics,  as  its  central  equation, 
Newton’s  second  law  of  motion  (equation  2.  )  is  of  this  type.  In  the  case  that  the  equation  is  also  linear,  we 
have  some  hopes  of  solving  it  analytically.  There  are  several  examples  of  this  type  of  equation  in  the  main 
text,  especially  in  section  2:  ,  where  we  solve  the  equation  of  motion  resulting  from  Newton’s  second  law  for 
three  special  cases,  and  section  8  ,  where  we  study  a  number  of  variants  of  the  harmonic  oscillator. 

For  the  case  that  the  equation  is  homogeneous  and  has  constant  coefficients,  we  can  write  down  the 
general  solution  .  The  equation  to  be  solved  is  of  the  form 


a 


d2x  ,  dx 

- 77  T"  b - 

dt2  df 


+  exit)  -  0. 


(A.24) 


For  the  case  that  a  —  0,  we  retrieve  a  first-order  differential  equation,  whose  solution  is  an  exponential  (as  can 
be  found  by  separation  ofvariables  and  integration) :  x(f)  =  Cexpictlb).  In  many  cases  an  exponential  is  also 
a  solution  of  equation  (  .24).  To  figure  out  which  exponential,  let’s  start  with  the  trial  function  (or  Ansatz’) 
xit)  -  exp(Af),  where  A  is  an  unknown  parameter.  Substituting  this  Ansatz  into  equation  (  ,2Z )  yields  the 
characteristic  polynomial  for  this  ode: 

aA2  +  bA  +  c  -  0,  (A.25) 

which  almost  always  has  two  solutions: 


b  Vb2-4ac 

—  i - 

2  a  2  a 


(A.  2  6) 


2  This  approach  generalizes  to  higher-order  differential  equations,  though  the  characteristic  polynomial  may  not  be  as  easy  to  solve  in 
the  general  case. 
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Note  that  the  solutions  can  be  real  or  complex.  If  there  are  two  of  them,  we  can  write  the  general  solution  of 
equation  ( \.24)  as  a  linear  combination  of  the  Ansatz  with  the  two  cases: 

x(f)  =  AeA+t  +  BeA-t,  (A.27) 


where  A  and  B  are  set  by  either  initial  or  boundary  conditions.  Since  the  A+  may  be  complex,  so  may  A  and 
B;  it’s  their  combination  that  should  give  a  real  number  (as  x(t)  is  real),  see  problem  A.3. la. 

In  the  case  that  equation  (  )  gives  only  one  solution,  the  corresponding  exponential  function  is  still 

a  solution  of  equation  (  2- ),  but  it  is  not  the  most  general  one,  as  we  only  can  put  a  single  undetermined 
constant  in  front  of  it.  We  therefore  need  a  second,  independent  solution.  To  guess  one,  here’s  a  third  useful 
trick  :  take  the  derivative  of  our  known  solution,  e  ,  with  respect  to  the  parameter  A.  This  gives  a  second 
Ansatz:  te ,  where  A  =  -bl2a.  Substituting  this  Ansatz  into  equation  ( \ .2-1 )  for  the  case  that  c  =  b2 12a,  we 
find: 


dzx  dx  bz 

+  b~r  + 
dr2  df  2  a 


1  b 

b2  \ 

-bt  (  b  \ 

--  + 

e  2a  +  bll - f 

V  a 

4a2  ) 

l  2  a  ) 

bt  b  _ 
2«  +  —  te 
4  a 


bt 

2«  =  0, 


(A.28) 


so  our  Ansatz  is  again  a  solution.  For  this  special  case,  the  general  solution  is  therefore  given  by 


bt  bt 

x(f)  -  Ae  2a  +  Bte  2a . 


(A.29) 


In  section  8.2,  where  we  discuss  the  damped  harmonic  oscillator,  the  special  case  corresponds  to  the  crit¬ 
ically  damped  oscillator.  We  get  an  underdamped  oscillator  when  the  roots  of  the  characteristic  polynomial 
are  complex,  and  an  overdamped  one  when  they  are  real. 


A.3.3.  Second-order  linear  ordinary  differential  equations  of  Euler 
TYPE 

There’s  a  second  class  of  linear  ordinary  differential  equations  that  we  can  solve  explicitly:  those  of  Euler  (or 
Cauchy-Euler)  type,  where  the  coefficient  in  front  of  a  derivative  contains  the  variable  to  the  power  of  the 
derivative,  i.e.,  for  a  second-order  differential  equation,  we  have  as  the  most  general  form: 


,  d2y  ,  dv 

ax  — -  +  bx - h  cy(x)  -  0. 

dx1  dx 


(A.30) 


Note  that  we  are  now  solving  for  y(x);  we  do  so  because  this  type  of  equation  typically  occurs  in  the  context 
of  position-  rather  than  time-dependent  functions.  An  example  is  the  Laplace  equation  (V2y  =  0)  in  polar 
coordinates.  Like  for  the  second  order  ode  with  constant  coefficients,  the  ode  of  Euler  type  can  be  generalized 
to  higher-order  equations. 

There  are  (at  least)  two  ways  to  solve  equation  (  .3( ):  through  an  Ansatz,  and  through  a  change  of  vari¬ 
ables.  For  the  Ansatz,  note  that  for  any  polynomial,  the  derivative  of  each  term  reduces  the  power  by  one, 
and  here  we’re  multiplying  each  such  term  with  the  variable  to  the  power  the  number  of  derivatives  .  This 
suggests  we  simply  try  a  polynomial,  so  our  Ansatz  here  will  be  y(x)  =  x”.  Substituting  in  equation  (  ) 

gives: 

ax2n(n  -  l)x"-2  +  bxnxn~l  +  ex11  =  [an(n  -  1)  +  bn  +  c]x"  =  0,  (A.31) 

so  we  get  another  second  order  polynomial  to  solve,  this  time  in  n: 


an2  +  [b-  a)n+  c  -  0 


b 

2  a 


1 

2 


Via  -  b)2  -  4 ac. 


(A.32) 


If  the  roots  in  equation  (A.32)  are  both  real  (the  most  common  case  in  physics  problems),  we  have  two  in¬ 
dependent  solutions,  and  we  are  done.  If  the  roots  are  complex,  we  also  have  two  independent  solutions, 

3You  may  wonder  how  we  know  that  this  is  the  most  general  solution  of  equation  [  1).  The  argument  follows  from  the  proof  of  the 

Picard  Lindelof  theorem,  which  states  that  under  fairly  weak  conditions  on  the  ode  (which  are  met  by  equation  (A.24),  there  exists  a 
single  unique  solution,  which  depends  on  as  many  free  parameters  as  the  order  of  the  equation.  For  example,  if  you  know  the  position 
and  its  derivative  at  t  =  0,  everything  else  is  fixed:  equation  (A.24)  gives  you  the  value  of  the  second  derivative,  from  which  you  can 
calculate  the  value  of  the  derivative  and  the  function  itself  at  some  later  time  A  t,  and  so  on.  As  equation  (A.27)  has  two  unknowns, 
we’ve  used  up  all  our  degrees  of  freedom  in  determining  them,  and  as  the  solution  is  unique,  it  cannot  contain  another  term. 
4Alternatively,  you  could  use  a  variant  of  variation  of  constants,  known  as  reduction  of  order;  see  section  A.3.4  and  problem  A.3.2. 

5An  admittedly  complicated  sentence,  but  if  you  find  yourself  wondering  about  it,  just  press  on,  and  return  later,  when  it  will  hopefully 
be  obvious. 
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though  they  involve  complex  powers  of  x;  like  for  the  equation  with  constant  coefficients,  we  can  rewrite 
these  as  real  functions  with  Euler’s  formula  (see  problem  A.3.1b).  For  the  case  that  we  have  only  one  root, 
we  again  apply  our  trick  to  get  a  second:  we  try  dxn/d n  =  x"  ln(x),  which  turns  out  to  be  indeed  a  solution 
(problem  ,1c),  and  the  general  solution  is  again  a  linear  combination  of  the  two  solutions  found. 

Alternatively,  we  could  have  solved  equation  (A.30)  by  a  change  of  variables.  Although  this  method  is 
occasionally  useful  (and  so  it’s  good  to  be  aware  of  its  existence) ,  there  is  no  systematic  way  of  deriving  which 
change  of  variables  will  do  the  trick,  so  you’ll  have  to  go  by  trial-and-error  (without  a  priori  guarantee  of 
success).  In  this  case,  this  process  leads  to  the  following  substitution: 

x=et,  y(x)  -  y[er)  =  (pit),  (A.33) 

where  we  introduce  (pi  t)  for  convenience.  Taking  derivatives  of  y(x)  with  the  chain  rule  gives 

dy  dy  dt  1  dcp  d2y  1  ( d2cp  d<M 

dx  dt  dx  x  dt’  dx2  x2  [  dt2  df  J  ’  ^  ^ 

where  we  kept  the  1/x  and  1/x2  as  they’ll  cancel  the  coefficients  in  equation  (  ).  Substituting  (  ) 

and  (  ,30  in  equation  (A30),  we  get 


d2<p  dip 

a — tt  +  [b-  a)-2-  +  c(b{t)  -  0, 
df 2  df 


(A.35) 


which  is  a  second  order  differential  equation  with  constant  coefficients,  and  thus  of  the  form  given  in  equa¬ 
tion  (  .2  ).  We  therefore  know  how  to  find  its  solutions,  and  can  use  equation  (A3 2)  to  transform  those 
solutions  back  to  functions  y(x). 


A.3.4.  CTION  OF  ORDER 

If  you  find  yourself  with  a  non-homogeneous  second  order  differential  equation  where  the  homogeneous 
equation  either  has  constant  coefficients  or  is  of  Euler  type,  you  can  again  use  the  technique  of  variation  of 
constants  to  find  a  particular  solution.  A  similar  technique,  known  as  reduction  of  order,  may  help  you  find 
solutions  to  a  second  (or  higher)  order  equation  where  the  coefficients  are  not  constant.  In  order  to  be  able 
to  use  this  technique,  you  need  to  know  a  solution  to  the  homogeneous  equation,  so  it  is  not  as  universally 
applicable  as  the  techniques  in  the  previous  two  sections,  but  still  frequently  very  helpful. 

Eet  us  write  the  general  non-homogeneous  second-order  linear  differential  equation  as 


d2y 

dx2 


+  pit ) 


dy 

dx 


+  q[x)y[x)  =  r(x). 


(A.36) 


Note  that  this  is  the  most  general  form:  if  there  is  a  coefficient  (constant  or  otherwise)  in  front  of  the  second 
derivative,  we  simply  divide  the  whole  equation  by  that  coefficient  and  redefine  the  coefficients  to  match 
equation  (  ).  Now  suppose  we  have  a  solution  yi(x)  of  the  homogeneous  equation  (so  for  the  case  that 

r(x)  =  0).  As  the  equation  is  homogeneous,  for  any  constant  v  the  function  vy\  (x)  will  also  be  a  solution.  As 
an  Ansatz  for  the  second  solution,  we’ll  try  a  variant  of  variation  of  constants,  and  take 


y2(x)  =  f(x)yi  (x), 


(A.37) 


where  v{x)  is  an  arbitrary  function.  Substituting  (A.37)  back  into  (A.36),  we  find 


d2  v 

''“s’ 


2Ai 

dx 


+  p(x)yi  (x) 


dv 
—  + 
dx 


d2yi 

dx2 


+  p(x) 


dyi 

dx 


+  g(x)yi  (x) 


v[x )  =  r(x). 


(A.38) 


We  recognize  the  prefactor  of  v{x )  as  exactly  the  homogeneous  equation,  which  yi(x)  satisfies,  so  this  term 
vanishes.  Now  defining  w (x)  =  d vl  dx,  we  are  left  with  a  first-order  equation  for  w{x): 


d  w 

yiix)—  + 

dx 


dvi 

2——  +  p(x)yi  (x) 
dx 


w{x)  =  r(x). 


(A.39) 


Equation  (A.39)  is  a  first-order  linear  differential  equation,  and  can  be  solved  by  the  techniques  from  sec¬ 
tion  (  ).  Integrating  the  equation  w(x)  =  dvi  dx  then  gives  us  v(x),  and  hence  the  second  solution  (  t.37) 

of  the  (inhomogeneous)  second  order  differential  equation. 
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A.3.5.  Power  series  solutions 

If  none  of  the  techniques  in  the  sections  above  apply  to  your  differential  equation,  there’s  one  last  Ansatz 
you  can  try:  a  power  series  expansion  of  your  solution.  To  illustrate,  we’ll  again  pick  a  concrete  example: 
Legendre’s  differential  equation,  given  by 


d 

dx 


dx 


+  n(n  +  l)y(x)  =  (1  - 


d2y 


,d  y 


x  )  — 7;  -  2x - F  n[n  +  l)y(x)  =  0, 

dx2  dx 


(A.40) 


where  n  is  an  integer.  As  an  Ansatz  for  the  solution,  we’ll  try  a  power  series  expansion  of  y(x): 

OO 

y(x)  =  E  akxk.  (A.41) 

k= o 


Our  task  is  now  to  find  numbers  ak  (many  of  which  may  be  zero)  such  that  (  )  is  a  solution  of  (  40). 

Fortunately,  we  can  simply  substitute  our  trial  solution  and  re-arrange  to  get 


0=  (1-x2) 


dx2 

OO 


r  OO  ' 

d 

r  OO  ^ 

E  akxk 
U=0 

-  2x — 
dx 

E  akxk 

,k=0 

Ar=l 


^k-0 


\ 


=  (l-x2)  ^k{k-\)akxk  2|-2x  '^jkakxk  1  +  n{n  +  1)  E  akxk 


\k=  o 


=  E  \ {-k[k  —  1)  - 2k+  n[n+  1))  akxk  +  k{k- \)akx 
k= 0 


=  E  [(-fc(fc  +  1)  +  n{n+  1))  ak  +  [k  +  2){k+  l)ajfc+2]  xk, 
k= 0 


/  A;=l 

A:— 2 1 


(A.42a) 

(A.42b) 

(A.42c) 

(A.42d) 


where  in  the  last  line,  we  ‘shifted’  the  index  of  the  last  term  .  We  do  so  in  order  to  get  at  an  expression  for  the 
coefficient  of  xl  for  any  value  of  k.  As  the  functions  xk  are  linearly  independent  (i.e.,  you  can’t  write  xk  as  a 
linear  combination  of  other  functions  xm  where  m  y  k),  the  coefficient  of  each  of  the  powers  in  the  sum  in 
equation  (A.42d)  has  to  vanish  for  the  sum  to  be  identically  zero.  This  gives  us  a  recurrence  relation  between 
the  coefficients  ak: 


ak+ 2  - 


k{k+  1)  -  n{n+  1) 

(fc  +  2)(k  +  1) 


ak ■ 


(A.43) 


Given  the  values  of  ao  and  a\  (the  two  degrees  of  freedom  that  our  second-order  differential  equation  allows 
us),  we  can  repeatedly  apply  equation  (  )  to  get  all  coefficients.  Note  that  for  k-n  the  coefficient  equals 

zero.  Therefore,  if  for  an  even  value  of  n,  we  set  a\  =  0,  and  for  an  odd  value  of  n,  we  set  Uq  -  0,  we  get  a  finite 
number  of  nonzero  coefficients.  The  resulting  solutions  are  polynomials,  characterized  by  the  number  n;  in 
this  case,  they’re  known  as  the  Legendre  polynomials,  typically  denoted  Pn{x),  and  normalized  (by  setting  the 
value  of  the  remaining  free  coefficient)  such  that  Pn{  1)  =  1.  Table  lists  the  first  five,  which  are  also  plotted 
in  figure  A.  a. 

Legendre  polynomials  have  many  other  interesting  properties  (many  of  which  can  be  found  in  either 
math  textbooks  or  on  their  Wikipedia  page).  They  occur  frequently  in  physics,  for  example  in  solving  prob¬ 
lems  involving  Newtonian  gravity  or  Laplace’s  equation  from  electrostatics. 

If  we  replace  the  n(n  +  1)  factor  in  the  Legendre  differential  equation  with  an  arbitrary  number  A,  the 
series  solution  remains  a  solution,  but  it  no  longer  terminates  .  There  are  many  other  differential  equations 


®Note  that  we  can  do  this  because  the  index  A;  is  a  ‘dummy’,  i.e.,  an  index  that  we  sum  over  and  thus  does  not  appear  in  the  end  answer. 
We  could  just  as  well  have  called  it  j,  or  we  could  define  j  =  k  —  1,  and  re-write  the  sum  over  k  as 


OO  OO  OO 

Y,akxk=  E  aj+ 1*J+1=  E  aj+ixJ+1. 

k= 0  j=-l  j=- 1 


Therefore 


OO 

E  ka^x^-1 

k= 0 


OO  OO 

X  O'  +  1)  a j+1xJ  =  X  0  + 1) « j+i  • 
j=- i  j= 0 


where  in  the  last  equality  we  drop  the  term  with  j  =  - 1  as  it  is  identically  zero.  Applying  this  concept  twice  on  the  last  term  of  (A. 42c) 
and  then  relabeling  j  =  k  in  the  result  gives  us  equation  (A.42d). 

7  Or  ‘orthogonal’.  The  proof  of  this  property  of  the  polynomial  functions  can  be  found  in  many  analysis  and  linear  algebra  books. 
8Except,  of  course,  if  you  choose  A  =  n{n  +  1). 
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n 

PnW 

0 

1 

1 

X 

2 

§(3x-l) 

3 

|  (5x3  -  3x) 

4 

g  (35x4  -  30x2  +  3) 

5 

g  (63x5  -  70x3  +  15x) 

Table  A.  1 :  The  first  five  (and  zeroth,  for  good  measure)  Legendre  polynomials,  the  solutions  of  equation  A.4t  for  the  given  value  of  n  and 
the  appropriate  choice  of  ao  and  a\ . 


a 


PM) 


•  n=1 

•  n=2 

•  n=3 

•  n=4 

•  n=5 


Figure  A.4:  Solutions  to  the  Legendre  and  Bessel  differential  equations,  (a)  The  first  five  Legendre  polynomials  (table  A.l).  Note  that  the 
polynomials  with  even  n  are  all  even,  and  those  with  odd  n  are  all  odd.  (b-c)  The  first  five  Bessel  functions  of  the  first  (b)  and  second  (c) 
kind. 


that  lead  to  both  infinite  series  and  polynomial  solutions.  A  well-known  example  is  the  Bessel  differential 


equation: 


x 2 — j  +  x—  +  (x2  -  n2)y{x)  =  0. 
dxz  dx 


(A.44) 


The  solutions  to  this  equation  are  known  as  the  Bessel  functions  of  the  first  and  second  kind  (see  prob¬ 
lem  3.3,  where  you’ll  prove  that  for  these  functions  the  series  never  terminates).  These  functions  gener¬ 
alize  the  sine  and  cosine  function  and  occur  in  the  vibrations  of  two-dimensional  surfaces.  Other  examples 
include  the  ffermite  and  Laguerre  polynomials,  which  feature  in  quantum  mechanics,  and  Airy  functions, 
which  you  can  encounter  when  studying  optics. 
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A.3.6.  ROBLEMS 

A.3.1  (a)  Suppose  we  have  a  solution  of  equation  (  )  where  the  roots  A+  of  the  characteristic  polynomial 

(equation  \.2  )  are  complex,  so  A+  =  a  ±  i (i.  Rewrite  the  general  solution  (  .27)  in  real  functions 
with  real  coefficients  C  and  D,  and  express  C  and  D  in  terms  of  A  and  B.  Hint:  use  Euler’s  formula 
elx  =  cos(x)  +  i  sin(x). 

(b)  Suppose  we  have  a  solution  of  equation  (  32)  where  the  roots  n±  are  complex,  so  n+  =  a±  ip.  To 
get  a  solution  of  equation  (A.30)  without  complex  numbers,  we  make  the  substitution  x  =  e1,  so 

xn±  _  xa±iP  -  eate±i75t 

Use  Euler’s  formula  again  to  rewrite  the  complex  exponential  in  terms  of  sines  and  cosines,  and 
make  the  back-substitution  to  x  to  show  that  the  general  solution  of  equation  (A.30)  in  this  case  is 
given  by 

y(x )  =  xa  [Acos(/31n(x))  +  sin(/)ln(x))] .  (A.45) 

(c)  Suppose  we  have  a  solution  of  equation  (  .32)  for  which  there  is  only  a  single  root  n.  Show  that 
the  derivative  of  xn  with  respect  to  n  is  in  this  case  also  a  solution  of  equation  (  ..30),  and  that  the 
general  solution  is  given  by 

y  (x)  -  xn[A  +  B ln(x)] .  (A.46) 

A.3.2  Use  the  method  of  reduction  of  order  to  obtain  a  second  solution  of  equation  (  )  for  the  case  that 

the  characteristic  polynomial  (equation  l.26)  has  only  a  single  root. 

A.3.3  (a)  Use  the  power  series  technique  to  find  a  solution  to  Bessel’s  differential  equation  (  ).  Why 

doesn’t  the  series  terminate  in  this  case?  Why  do  you  only  get  one  family  of  solutions?  We’ll  call 
these  solutions  ‘Bessel  functions  of  the  hist  kind’  and  label  them  as  }n[x)  (see  figure  A.-  b). 

(b)  Use  the  method  of  reduction  of  order  to  find  a  second  family  of  solutions  to  the  Bessel  differential 
equation,  known  as  ‘Bessel  functions  of  the  second  kind’  (F„(x),  see  figure  c). 


B 

Some  equations  and  constants 

B.l.  Physical  constants 


Name 

Symbol 

Value 

Speed  of  light 

c 

3.00 -10“  m/s 

Elementary  charge 

e 

1.60  •  10-19  C 

Electron  mass 

me 

9.11  10"31  kg 

Proton  mass 

mp 

1.67  •  10-27  kg 

Gravitational  constant 

G 

6.67 -10"11  N-m2/kg2 

Gravitational  acceleration 

g 

9.81  m/s2 

Planck’s  constant 

h 

6.63- 10-34  J-s 

h  -  hi 2n 

1.05- 10"34 1- s 

Table  B.l:  Physical  constants 


B.2.  Moments  of  inertia 


Object 

Moment  of  inertia 

Thin  stick  (length  L) 

Ring  of  hollow  cylinder  (radius  R) 

Disk  or  solid  cylinder  (radius  R ) 

Hollow  sphere  (radius  R ) 

Solid  sphere  (radius  R ) 

Rectangle  (size  ax  b),  perpendicular  axis 
Rectangle  (size  ax  b),  axis  parallel  to  side  b 

A  ML2 

MR 2 

\mr2 

|  MR2 
|  MR2 

±M(a2  +  b2) 

±Ma2 

Table  B.2:  Moments  of  inertia,  all  about  axes  of  symmetry  through  the  center  of  mass. 
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B.3.  Solar  system  objects 


Sun 

Earth 

Moon 

Mass  (kg) 

1.99 -103(J 

5.97 -1024 

7.35 -1022 

Mean  radius  (m) 

6.96- 108 

6.37 -106 

1.74 -106 

Orbital  period  (s) 

6-1015 

.316- 107 

2.36 -106 

(200  My) 

(365.25  days) 

(27.3  days) 

Mean  orbital  radius  (m) 

2.6- 102° 

1.50- 1011 

3.85 -108 

Mean  density  (kg/m3) 

1.4- 103 

5.5 -103 

3.3  103 

Table  B.3:  Characteristics  of  the  Sun,  Earth  and  Moon. 


Name 

Symbol 

Equatorial 

radius 

Mass 

Mean  orbit 
radius 

Orbital  pe¬ 
riod 

Inclination 

Orbital  ec¬ 
centricity 

Rotation  pe¬ 
riod 

Confirmed 

moons 

Axial  tilt 

Mercury 

9 

0.382 

0.06 

0.39 

0.24 

3.38 

0.206 

58.64 

0 

0.04 

Venus 

9 

0.949 

0.82 

0.72 

0.62 

3.86 

0.007 

-243.02 

0 

177.36 

Earth 

6 

1 

1 

1 

1 

7.25 

0.017 

1 

1 

23.44 

Moon 

<L 

0.272 

0.0123 

384399 

27.32158 

18.29-28.58 

0.0549 

27.32158 

0 

6.68 

Mars 

<? 

0.532 

0.11 

1.52 

1.88 

5.65 

0.093 

1.03 

2 

25.19 

Ceres 

0.0742 

0.00016 

2.766 

4.599 

10.59 

0.08 

0.3781 

0 

4 

Jupiter 

% 

11.209 

317.8 

5.2 

11.86 

6.09 

0.048 

0.41 

69 

3.13 

Io 

0.285 

0.015 

421600 

1.769 

0.04 

0.0041 

1.769 

0 

0 

Europa 

0.246 

0.008 

670900 

3.551 

0.47 

0.009 

3.551 

0 

0 

Ganymede 

0.413 

0.025 

1070400 

7.155 

1.85 

0.0013 

7.155 

0 

0 

Callisto 

0.378 

0.018 

1882700 

16.689 

0.2 

0.0074 

16.689 

0 

0 

Saturn 

i) 

9.449 

95.2 

9.54 

29.46 

5.51 

0.054 

0.43 

62 

26.73 

Titan 

0.404 

0.023 

1221870 

15.945 

0.33 

0.0288 

15.945 

0 

0 

Uranus 

s 

4.007 

14.6 

19.22 

84.01 

6.48 

0.047 

-0.72 

27 

97.77 

Oberon 

0.119 

0.00051 

583519 

13.46 

0.1 

0.0014 

13.46 

0 

0 

Neptune 

s 

3.883 

17.2 

30.06 

164.8 

6.43 

0.009 

0.67 

14 

28.32 

Triton 

0.212 

0.00358 

354759 

5.877 

157 

0.00002 

5.877 

0 

0 

Pluto 

E 

0.186 

0.0022 

39.482 

247.9 

17.14 

0.25 

6.39 

5 

119.59 

Charon 

0.095 

0.00025 

17536 

6.387 

0.001 

0.0022 

6.387 

0 

unknown 

Haumea 

0.13 

0.0007 

43.335 

285.4 

28.19 

0.19 

0.167 

2 

unknown 

Make  make 

0.11 

unknown 

45.792 

309.9 

28.96 

0.16 

unknown 

1 

unknown 

Eris 

0.18 

0.0028 

67.668 

557 

44.19 

0.44 

unknown 

1 

unknown 

Table  B.4:  Properties  of  a  number  of  solar  system  objects.  Equatorial  radii  and  masses  are  compared  to  those  of  Earth  (see  table  B.3). 
Orbital  properties  are  around  primary  (the  sun  for  (dwarf)  planets,  the  planet  for  moons).  Orbital  radii  and  periods  for  planets  again 
compared  to  Earth,  for  moons  in  kilograms  and  days.  Rotation  period  for  all  objects  in  days.  Inclination  and  axial  tilt  in  degrees.  Data 
from  NASA  planetary  fact  sheets  [31]. 
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B.4.  Equat: 


B.4.1.  Vector  derivatives 


Gradient: 


V/(r)  =  V  f{x,y,z)  = 


dxf 

dyf 

dzf 


df\  df  „  df 
-f-x+  -r~y+ 
dx  dy  dz  ' 


(B.  1) 


Divergence: 


f 

V-v=[dx,dy,dz)- 


vx  1 

Vy 
Vz  j 


dvx  dvy  dvz 
dx  dy  dz 


(B.2) 


Curl: 


VxA=[dx,dy,dz)x 


Ax 

r  dyAz-dzAy  1 

Ay 

= 

dzAx  -  dxAz 

<  Az  j 

^  3XAy  —  dyAX  J 

(B.3) 


B.4.2.  Special  relativity 

Lorentz  transformations  for  the  coordinates  of  a  frame  S'  that  moves  with  a  speed  u  in  the  positive  x- 
direction  of  frame  S: 


x' 

=  Jiu)  I 

(  u  > 

X - ct 

ct' 

=  J{u)  I 

f  U  ' 

let - X 

Jill) 

1 

VT- 

-  ( ulc )2 

Velocity  addition  in  a  relativistic  system: 

u+  v' 

Vjc  —  ~  o 

1  +  uvx/c z 


1  v'y 

y(w)  1  +  uv'xlc2 


(B.4) 

(B.5) 

(B.6) 


(longitudinal) 


(transversal) . 


(B.7) 
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